Abstract
Adherent-invasive Escherichia coli (AIEC) have been extensively implicated in Crohn’s disease pathogenesis. Currently, AIEC is identified phenotypically, since no molecular marker specific for AIEC exists. An algorithm based on single nucleotide polymorphisms was previously presented as a potential molecular tool to classify AIEC/non-AIEC, with 84% accuracy on a collection of 50 strains isolated in Girona (Spain). Herein, our aim was to determine the accuracy of the tool using AIEC/non-AIEC isolates from different geographical origins and extraintestinal pathogenic E. coli (ExPEC) strains. The accuracy of the tool was significantly reduced (61%) when external AIEC/non-AIEC strains from France, Chile, Mallorca (Spain) and Australia (82 AIEC, 57 non-AIEC and 45 ExPEC strains in total) were included. However, the inclusion of only the ExPEC strains showed that the tool was fairly accurate at differentiating these two close pathotypes (84.6% sensitivity; 79% accuracy). Moreover, the accuracy was still high (81%) for those AIEC/non-AIEC strains isolated from Girona and Mallorca (N = 63); two collections obtained from independent studies but geographically close. Our findings indicate that the presented tool is not universal since it would be only applicable for strains from similar geographic origin and demonstrates the need to include strains from different origins to validate such tools.
Similar content being viewed by others
Introduction
The involvement of the adherent-invasive Escherichia coli (AIEC) pathotype in Crohn’s disease (CD) pathogenesis has been extensively supported, as many researchers have reported higher AIEC prevalence in CD patients than controls1,2,3,4,5,6,7,8,9, and mechanisms of pathogenicity have been linked with CD pathophysiology10,11,12,13,14,15,16,17. The ability to adhere to and invade intestinal epithelial cells, as well as, to survive and replicate inside macrophages are key characteristics of AIEC strains2. No gene or sequence exclusive to the AIEC pathotype has been identified, and AIEC identification currently remains challenging; the only way to identify an AIEC strain is by assessing bacterial infection in cell culture assays which are non-standardised and highly time-consuming2.
AIEC strains isolated to date are clonally diverse and belong to distinct serotypes. Although AIEC primarily fall into the B2 phylogroup, AIEC strains belonging to the A, B1, and D phylogroups have also been isolated1,3,4,6,9,18,19,20,21,22,23. In terms of virulence genes, AIEC resemble extraintestinal pathogenic E. coli (ExPEC), which are mostly non-invasive and the majority of them do not behave like AIEC3,24,25,26, with the exception of some isolates26,27.
Up to now, six genetic elements (pduC, lpfA, lpfA + gipA, chuA, 29 point mutations and 3 genomic regions) have been suggested as putative AIEC molecular markers6,21,23,28,29, however they either present low sensitivity or have been studied in a small number of strains. In a previous study conducted in our research group30, we designed a classification algorithm based on the identification of the nucleotides present in three Single Nucleotide Polymorphisms (SNPs). This algorithm displayed 82.1% specificity, 86.4% sensitivity and 84.0% accuracy within our Spanish strain collection. Given the high genotypic variability of AIEC, our aim was to validate the tool previously presented in AIEC/non-AIEC strains from distant geographical origins and ExPEC strains in order to assess the usefulness of these SNPs as molecular signatures for AIEC screening in external collections.
Results
Confirmation of the validity of the algorithm30 in additional geographically distant AIEC/non-AIEC and ExPEC strains was performed.
When all AIEC/non-AIEC strains from Girona, Mallorca, France, Chile and Australia, as well as ExPEC strains were analysed, 73/98 of the non-AIEC strains were correctly classified but only 39/86 of the AIEC strains were appropriately predicted, resulting in a high probability of obtaining false negatives (54.6%). Therefore, in comparison to the values obtained within our strain collection (82.1% specificity, 86.4% sensitivity and 84.0% accuracy), the global accuracy was significantly reduced (60.9%), with decreased specificity (74.5%) and especially lower sensitivity (45.4%) (Table 1, Fig. 1). In contrast to the previous study30, the SNPs that were found to be differentially distributed among our AIEC and non-AIEC strains (E3-E4_4.4 and E5-E6_3.16 = 3.22(2)) showed similar frequencies according to phenotype when all the strains were considered (Table 2). According to the algorithm30, strains displaying guanine (G) in SNP E3-E4_4.4 are classified as non-AIEC, and the same occurs for those that do not have the gene (−) where SNP E3-E4_4.4 is located and display a nucleotide other than G at SNP E5-E6_3.16 = 3.22(2). Indeed, most AIEC strains (54.6%) were incorrectly classified because they accomplished these conditions (Fig. 2). Other possible SNP combinations were considered for all the strains included in the study but none improved the precision of the algorithm.
Despite global accuracy of the algorithm being much lower when all strains were considered, the method was suitable for geographically close strain collections. Indeed, if only Spanish strains (Girona and Mallorca) (N = 63) were considered, the accuracy of the tool was maintained (80.9%) (Table 1). Specificity was also good (82.3%), meaning there was a low probability of false positives (17.7%) (Fig. 1). Therefore, strains from different laboratory collections, but of similar geographical origin, were suitable for screening by this method.
The inclusion of ExPEC strains (N = 45) revealed that the tool was also useful for distinguishing the ExPEC and AIEC pathotypes, since 84.6% of strains displaying the AIEC phenotype were correctly classified, with a global accuracy of 78.9% (Table 1, Fig. 1).
These results demonstrated that the classification algorithm presented has limited applicability for all E. coli strains assessed. However, this novel molecular tool showed promising results for Spanish AIEC and ExPEC strains.
Discussion
The identification of molecular tools or rapid tests to easily identify the AIEC pathotype would be of great interest to scientists studying the epidemiology of the pathotype, as well as clinicians hoping to detect which patients are colonised by AIEC to apply personalised treatments. Although several studies have been conducted with this aim in mind, there is still no molecular signature specific to AIEC6,21,23,28,29.
In a previous study we performed comparative genomics of three AIEC/non-AIEC clone pairs and presented a classification algorithm that combines three SNPs, allowing for the classification of phylogenetically and phenotypically diverse E. coli isolates with a high accuracy rate in our strain collection30. Since the application of a molecular tool could assist in overcoming the problem of AIEC identification, we further tested the specificity and sensitivity of the tool in additional geographically distant and phylogenetically diverse AIEC strains, as well as ExPEC strains, which share genetic and phenotypic features3,24,25,26.
The tool was found to be accurate enough to distinguish between AIEC and ExPEC strains, since the sensitivity was 84.6% and the accuracy was 78.9%. In this case, we assessed both AIEC/non-AIEC from Girona (Spain) and ExPEC strains, the latter being mostly Spanish isolates. These results indicated that for a given geographic origin this algorithm could be applied to differentiate ExPEC from AIEC. So far, most of the studies looking for AIEC biomarkers have not included ExPEC strains in their analysis6,21,23,28. There is only one that focused on synonymous and non-synonymous SNPs along the genome of four B2-AIEC strains that could differentiate them from other B2-non-AIEC and B2-ExPEC genomes available in databases. Although they found 29 SNPs that could separate AIEC from non-AIEC using a bioinformatics approach, but did not include the three SNPs in the presented algorithm, it did not find a signature sequence that distinguishes AIEC from ExPEC29. It is not possible to determine whether the high accuracy value we reported is due to similar geographic origin (40 from Spain and 5 USA) or not. Thus the inclusion of other ExPEC strains would be needed to validate the tool further. Unfortunately, the predicted values of the tool decreased considerably (60.9% of accuracy) when strains across several geographic regions were considered. AIEC isolates from France, Chile and Australia were poorly discriminated with the SNP algorithm presented, resulting in significantly reduced sensitivity values (32.3, 0 and 15.4% respectively). Of note, this algorithm may be suitable for Spanish strains, because the accuracy was still high when two different collections of strains were studied (Girona and Mallorca) (80.9% accuracy). Taking into account that the variable gene content of E. coli is highly variable across different geographic regions31, this variation contributes to the algorithm not being applicable across geographically diverse regions and it is subjected to possible variations in the accuracy presented in a particular country.
In conclusion, the molecular tool that we previously proposed30 is not universal since its accuracy was reduced to 60.9% once a larger strain collection from different geographic locations and pathotypes was screened. We suspect it might be a good discrimination tool for a particular geographic location, in this case Spain. However, this observation should be confirmed with the addition of other Spanish strain collections including AIEC, non-AIEC, and other E. coli intestinal and extraintestinal pathotypes. The study of new SNPs that could be useful to distinguish between AIEC/non-AIEC strains from different geographical origins might be time-consuming and unprofitable and should consider many aspects that make it even more complicated (for example, the moment of strain isolation and the patient’s treatment). Therefore, we believe that new approaches (e.g. transcriptomics, metabolomics or epigenetics) should be applied to find a universal AIEC biomarker that could be used as a rapid standardised method for detecting AIEC from E. coli isolates, or maybe just E. coli isolates that have a strong colonizing ability. Nonetheless, there is a possibility that a no universal marker exists and then it would be interesting to look for a biomarker that englobes the majority of AIEC strains32. In any case, this work highlighted the importance of validating putative molecular markers in a diverse strain collection, in terms of geographic origin and pathotype, in order to assess whether or not it could be used universally.
Methods
The SNPs included in the algorithm (E3-E4_4.4, E5-E6_3.16 = 3.22(2) and E5-E6_3.12) were screened by PCR and Sanger sequencing. Primers and PCR conditions are indicated in Table 3. Apart from the strains assessed in the previous study (22 AIEC and 28 non-AIEC, which includes LF82 strain)30, this collection comprised 60 AIEC and 29 non-AIEC strains mainly isolated from CD patients and controls from distinct geographical origin (Spain (Mallorca)6, Chile6, France and Australia33) (Table S1). Most of these strains were phenotypically characterised in previous studies6,33. The adhesion and invasion indices of 25/33 Australian strains were measured in this study as previously described1,30,34 in order to classify them phenotypically as AIEC or non-AIEC. In addition, 45 strains isolated from patients with extraintestinal diseases were also included; these were previously isolated from American patients with meningitis35, and Spanish patients with sepsis26 or urinary tract infection36 (Table S1). Phenotypic characterisation of these strains was performed by Martinez-Medina et al.26; in which four strains presented the AIEC-phenotype and were considered as such in the analysis and 41 did not (these were classified as non-AIEC).
Strains studied in this study were previously isolated under the approval of the Ethics Committee 183 of Clinical Investigation of the Hospital Josep Trueta of Girona on May 22 2006; Ethical 184 committee of Hospital Saint-Louis (CPP#2009/17); Institutional Review Board of Clínica Las 185 Condes, Faculty of Medicine, Universidad de Chile; Ethics Committee of the Northern 186 Metropolitan Health Service, Santiago, Chile; the Balearic Islands’ Ethical Committee, Spain; and 187 ACT Health Human Research Ethics Committee (ETH.5.07.464). Subjects gave written 188 informed consent in accordance with the Declaration of Helsinki.
The differences in the distribution of nucleotides present in each polymorphic site between phenotype were calculated using the Χ2 test. To establish the usefulness of the algorithm for AIEC identification, the specificity, sensitivity and accuracy values were measured as follows: Sensitivity (%)= (true positives/(true positives + false negatives)) × 100, Specificity (%)= (true negatives/(true negatives + false positives)) x 100; and, Accuracy (%)= ((true positives + true negatives)/(total of cases)) × 100. A p-value ≤ 0.05 was considered statistically significant in all cases.
References
Martinez-Medina, M. et al. Molecular diversity of Escherichia coli in the human gut: New ecological evidence supporting the role of adherent-invasive E. coli (AIEC) in Crohn’s disease. Inflamm. Bowel Dis. 15, 872–882 (2009).
Darfeuille-Michaud, A. et al. High prevalence of adherent-invasive Escherichia coli associated with ileal mucosa in Crohn’s disease. Gastroenterology 127, 412–421 (2004).
Baumgart, M. et al. Culture independent analysis of ileal mucosa reveals a selective increase in invasive Escherichia coli of novel phylogeny relative to depletion of Clostridiales in Crohn’s disease involving the ileum. ISME J. 1, 403–418 (2007).
Sasaki, M. et al. Invasive Escherichia coli are a feature of Crohn’s disease. Lab. Investig. 87, (2007).
Dogan, B. et al. Multidrug resistance is common in Escherichia coli associated with ileal Crohnʼs disease. Inflamm. Bowel Dis. 19, 141–150 (2013).
Céspedes, S. et al. Genetic diversity and virulence determinants of Escherichia coli strains isolated from patients with Crohn’s disease in Spain and Chile. Front. Microbiol. 8, 639 (2017).
Raso, T. et al. Analysis of Escherichia coli isolated from patients affected by Crohn’s Disease. Curr. Microbiol. 63, 131–137 (2011).
Negroni, A. et al. Characterization of adherent-invasive Escherichia coli isolated from pediatric patients with inflammatory bowel disease. Inflamm. Bowel Dis. 18, 913–924 (2012).
Conte, M. P. et al. Adherent-invasive Escherichia coli (AIEC) in pediatric Crohn’s disease patients: phenotypic and genetic pathogenic features. BMC Res. Notes 7, 748 (2014).
Barnich, N. et al. CEACAM6 acts as a receptor for adherent-invasive E. coli, supporting ileal mucosa colonization in Crohn disease. J. Clin. Invest. 117, 1566–74 (2007).
Carvalho, F. a et al. Crohn’s disease adherent-invasive Escherichia coli colonize and induce strong gut inflammation in transgenic mice expressing human CEACAM. J. Exp. Med. 206, 2179–2189 (2009).
Chassaing, B. et al. Crohn disease-associated adherent-invasive E. coli bacteria target mouse and human Peyer’s patches via long polar fimbriae. J. Clin. Invest. 121, 966–75 (2011).
Bringer, M.-A., Barnich, N., Glasser, A.-L., Bardot, O. & Darfeuille-Michaud, A. HtrA stress protein is involved in intramacrophagic replication of adherent and invasive Escherichia coli strain LF82 isolated from a patient with Crohn’s disease. Infect. Immun. 73, 712–21 (2005).
Glasser, A. et al. Adherent invasive Escherichia coli strains from patients with Crohn’s disease survive and replicate within macrophages without inducing host cell death. Infect. Immun. Immun 69, 5529–37 (2001).
Meconi, S. et al. Adherent-invasive Escherichia coli isolated from Crohn’s disease patients induce granulomas in vitro. Cell. Microbiol. 9, 1252–1261 (2007).
Lapaquette, P., Bringer, M.-A. & Darfeuille-Michaud, A. Defects in autophagy favour adherent-invasive Escherichia coli persistence within macrophages leading to increased pro-inflammatory response. Cell. Microbiol. 14, 791–807 (2012).
Nguyen, H. T. T. et al. Crohn’s disease-associated adherent invasive Escherichia coli modulate levels of microRNAs in intestinal epithelial cells to reduce autophagy. Gastroenterology 146, 508–519 (2014).
Martin, H. M. et al. Enhanced Escherichia coli adherence and invasion in Crohn’s disease and colon cancer. Gastroenterology 127, 80–93 (2004).
Masseret, E. et al. Genetically related Escherichia coli strains associated with Crohn’ s disease. Gut 48, 320–325 (2001).
Sepehri, S., Kotlowski, R., Bernstein, C. N. & Krause, D. O. Phylogenetic analysis of inflammatory bowel disease associated Escherichia coli and the FimH virulence determinant. Inflamm. Bowel Dis. 15, 1737–1745 (2009).
Desilets, M. et al. Genome-based definition of an inflammatory bowel disease-associated adherent-invasive Escherichia coli pathovar. Inflamm. Bowel Dis. 22, 1–12 (2016).
Schippa, S. et al. Dominant genotypes in mucosa-associated Escherichia coli strains from pediatric patients with inflammatory bowel disease. Inflamm. Bowel Dis. 15, 661–672 (2009).
Dogan, B. et al. Inflammation-associated adherent-invasive Escherichia coli are enriched in pathways for use of propanediol and iron and M-cell. Inflamm. Bowel Dis. 20, 1919–1932 (2014).
Miquel, S. et al. Complete genome sequence of Crohn’s disease-associated adherent-invasive E. coli strain LF82. PLoS One 5, 1–16 (2010).
Nash, J. H. et al. Genome sequence of adherent-invasive Escherichia coli and comparative genomic analysis with other E. coli pathotypes. BMC Genomics 11, 667 (2010).
Martinez-Medina, M. et al. Similarity and divergence among adherent-invasive Escherichia coli and extraintestinal pathogenic E. coli strains. J. Clin. Microbiol. 47, 3968–79 (2009).
Conte, M. P. et al. The adherent/invasive Escherichia coli strain LF82 invades and persists in human prostate cell line RWPE-1, Activating a Strong Inflammatory Response. Infect. Immun. 84, 3105–3113 (2016).
Vazeille, E. et al. GipA factor supports colonization of Peyer’s Patches by Crohn’s Disease-associated Escherichia coli. Inflamm. Bowel Dis. 22, 68–81 (2016).
Deshpande, N. P., Wilkins, M. R., Mitchell, H. M. & Kaakoush, N. O. Novel genetic markers define a subgroup of pathogenic Escherichia coli strains belonging to the B2 phylogenetic group. FEMS Microbiol. Lett. 1–7, https://doi.org/10.1093/femsle/fnv193 (2015).
Camprubí-Font, C. et al. Comparative genomics reveals new single-nucleotide polymorphisms that can assist in identification of adherent-invasive Escherichia coli. Sci. Rep. 8, 1–11 (2018).
Gordon, D. M. et al. Fine-scale structure analysis shows epidemic patterns of clonalcomplex 95, a cosmopolitan Escherichia coli lineage responsible for extraintestinal infection. mSphere 2, (2017).
Camprubí-Font C, M.-M. M. Why the discovery of adherent-invasive Escherichia coli molecular markers is so challenging? World J Biol Chem In press, (2020).
O’Brien, C. L. et al. Comparative genomics of Crohn’s disease-associated adherent-invasive Escherichia coli. Gut 66, 1382–1389 (2016).
Boudeau, J., Glasser, A. L., Masseret, E., Joly, B. & Darfeuille-Michaud, A. Invasive ability of an Escherichia coli strain isolated from the ileal mucosa of a patient with Crohn’s disease. Infect. Immun. 67, 4499–509 (1999).
Bidet, P. et al. Combined multilocus sequence typing and O serogrouping distinguishes Escherichia coli subtypes associated with infant urosepsis and/or meningitis. J. Infect. Dis. 196, 297–303 (2007).
Blanco, M., Blanco, J. E., Alonso, M. P. & Blanco, J. Virulence factors and O groups of Escherichia coli isolates from patients with acute pyelonephritis, cystitis and asymptomatic bacteriuria. Eur. J. Epidemiol. 12, 191–8 (1996).
Author information
Authors and Affiliations
Contributions
C.C.F. and P.B. performed experimental analysis, R.M.V., N.B. and C.O.B. provided materials, C.C.F. carried out data analysis, C.C.F. and M.M.M. wrote the manuscript and M.M.M. and R.M.V. provided funding. All authors critically reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Camprubí-Font, C., Bustamante, P., Vidal, R.M. et al. Study of a classification algorithm for AIEC identification in geographically distinct E. coli strains. Sci Rep 10, 8094 (2020). https://doi.org/10.1038/s41598-020-64894-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-64894-5
- Springer Nature Limited