Abstract
Background
COVID-19 is a complex, multi-system disease with varying severity and symptoms. Identifying changes in critically ill COVID-19 patients’ proteomes enables a better understanding of markers associated with susceptibility, symptoms, and treatment. We performed plasma antibody microarray and machine learning analyses to identify novel proteins of COVID-19.
Methods
A case-control study comparing the concentration of 2000 plasma proteins in age- and sex-matched COVID-19 inpatients, non-COVID-19 sepsis controls, and healthy control subjects. Machine learning was used to identify a unique proteome signature in COVID-19 patients. Protein expression was correlated with clinically relevant variables and analyzed for temporal changes over hospitalization days 1, 3, 7, and 10. Expert-curated protein expression information was analyzed with Natural language processing (NLP) to determine organ- and cell-specific expression.
Results
Machine learning identified a 28-protein model that accurately differentiated COVID-19 patients from ICU non-COVID-19 patients (accuracy = 0.89, AUC = 1.00, F1 = 0.89) and healthy controls (accuracy = 0.89, AUC = 1.00, F1 = 0.88). An optimal nine-protein model (PF4V1, NUCB1, CrkL, SerpinD1, Fen1, GATA-4, ProSAAS, PARK7, and NET1) maintained high classification ability. Specific proteins correlated with hemoglobin, coagulation factors, hypertension, and high-flow nasal cannula intervention (P < 0.01). Time-course analysis of the 28 leading proteins demonstrated no significant temporal changes within the COVID-19 cohort. NLP analysis identified multi-system expression of the key proteins, with the digestive and nervous systems being the leading systems.
Conclusions
The plasma proteome of critically ill COVID-19 patients was distinguishable from that of non-COVID-19 sepsis controls and healthy control subjects. The leading 28 proteins and their subset of 9 proteins yielded accurate classification models and are expressed in multiple organ systems. The identified COVID-19 proteomic signature helps elucidate COVID-19 pathophysiology and may guide future COVID-19 treatment development.
Similar content being viewed by others
Introduction
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) induces coronavirus disease 2019 (COVID-19), a pandemic disease affecting more than 750 million individuals with over 6.8 million deaths [1, 2]. COVID-19 vaccinations and alternative variants influence the incidence and severity of new COVID-19 cases [3,4,5]; consequently, an improved understanding of the disease is necessary to counteract possible vaccine breakthroughs [6, 7]. Individuals with COVID-19 present with heterogeneous symptoms and severity due to the complex, multi-system pathophysiological impact of the SARS-CoV-2 virus [8,9,10,11]. COVID-19 severity is also complicated by various demographic and clinical risk factors, including age, sex, and pre-existing comorbidities [12,13,14,15].
SARS-CoV-2 infection triggers an innate immune response characterized by elevations in plasma pro-inflammatory cytokines, proteases and related proteins [16,17,18,19,20,21,22]. Vascular injury and endothelial dysregulation are key components of COVID-19, often resulting in microvascular thrombosis [23,24,25,26]. A humoral immune response follows the innate reaction, with robust production of SARS-CoV-2-specific antibodies [27,28,29]. In critically ill patients, COVID-19 results in impaired immune cell homing and programmed cell death. Specifically, antigen presentation and B/T-cell function is reduced, neutrophils and M1-type macrophages are repurposed, endothelia and fibroblasts are disrupted, myeloid lines become reactive, and the extracellular matrix is altered [30]. Despite a wealth of knowledge on COVID-19 pathophysiology, a unique proteomic signature that (1) includes proteins expressed across multiple systems and (2) that can be used to identify novel connected pathways, remains elusive.
This study aims to identify proteins specific to critically ill COVID-19 patients relative to age- and sex-matched non-COVID-19 sepsis patients and healthy control subjects. Our specific objectives were: (1) to measure the concentrations of 2,000 plasma proteins with antibody microarrays from the three cohorts; (2) to determine the relative importance of the plasma proteins in identifying COVID-19 patients to develop classification models; (3) to correlate the leading proteins to clinically relevant variables; (4) to investigate expression changes in the leading proteins on hospitalization days 1, 3, 7, and 10; and (5) to determine the cell type and organ system expression patterns of the leading proteins.
Methods
Study participants, blood sampling, and Cohort Matching
We used the Sepsis 3.0 criteria, which does not require pathogen identification, to screen patients admitted to our intensive care unit (ICU) [31]. All COVID-19 participants were pre-vaccinated. Two SARS-CoV-2 viral genes (RdRP and E) were detected using a polymerase chain reaction to confirm or refute COVID-19 status [32]. Blood was drawn on ICU days 1, 3, 7, and 10 for COVID-19 patients and on ICU days 1 and 3 for non-COVID-19 patients, depending on their continued admission in the ICU. Blood was obtained via indwelling catheters, and if a venipuncture was required, research blood draws were coordinated with a clinically indicated blood draw. In keeping with accepted research phlebotomy protocols for adult patients, blood draws did not exceed maximal volumes [33]. Blood was centrifuged, the plasma isolated and aliquoted at 250 µL in cryovials, and frozen at − 80 °C. All samples remained frozen until use, and freeze/thaw cycles were avoided. The healthy control subjects were individuals without disease, acute illness, or prescription medications and whose samples were collected prior to the emergence of SARS-CoV-2 (Translational Research Centre, London, ON; Directed by Dr. D.D. Fraser) [34, 35]. Final participant groups were constructed by age- and sex-matching ICU COVID-19 patients with ICU non-COVID-19 sepsis controls and healthy control subjects, resulting in 15 participants per group.
Patient demographics and Clinical Data
Baseline characteristics for COVID-19 and non-COVID-19 sepsis controls on ICU admission Day 1 were recorded, including age, sex, comorbidities, standard hospital laboratory measurements, PaO2 to FiO2 ratio, and chest radiograph findings. Also, the Multiple Organ Dysfunction Score (MODS) and Sequential Organ Failure Assessment Score (SOFA) were calculated [31, 36]. Clinical interventions received during the observation period were also recorded, including the use of antibiotics, antiviral agents, systemic corticosteroids, vasoactive medications, antiplatelet treatment, anticoagulation treatment, renal replacement therapy, high-flow oxygen therapy, and mechanical ventilation (both invasive and non-invasive). For healthy controls, only age and sex were available.
Antibody microarray
The RayBio® L-Series Human Antibody Array 2000 kit (RayBiotech Life Inc., GA, USA) was used to measure 2,000 proteins in plasma obtained from age- and sex-matched COVID-19 and non-COVID-19 sepsis patients, as well as healthy control subjects. The kits detect a broad range of proteins, including, but not limited to, cytokines, growth factors, receptors, signalling proteins, metabolic enzymes, and epigenetic markers. Prior to sample analysis, the stored plasma cryotubes underwent visual inspection to ensure that they were sealed and intact. The plasma samples were then thawed, visually inspected, and centrifuged to remove low molecular weight amine derivaties and unwanted buffer (plasma samples were only used if free of obvious contamination, hemolysis, precipitate, and lipemia). A labelling reagent was applied to biotinylate the purified plasma. Before application of plasma, each lot of the antibody array slides was tested with a positive control to verify accuracy within a predefined range (e.g., CV%). A blocking buffer was then applied to the microarray glass slides, followed by the biotinylated plasma samples at a 20x dilution. A streptavidin-conjugated fluorescent dye (CY3-Equivelent) was applied, and protein expression was measured via laser fluorescence scanning. There were 4 arrays (“y”) that contained 493–507 protein targets (“X”; spots), for a total of 2000 protein targets. Of the 4 arrays, one was arbitrarily selected to be the “reference array”, to which all the other arrays were normalized. Background level subtraction was performed by measuring the local background around each spot and subtracting that from the measured spot fluorescence. Fluorescence levels were normalized as follows: X(Ny) = X(y) * P1/P(y); where: P1 = mean signal intensity of Positive Controls on reference array, P(y) = mean signal intensity of Positive Controls on Array “y”; X(y) = mean signal intensity for spot “X” on Array “y”; and X(Ny) = normalized signal intensity for spot “X” on Array “y”. The data used for our analysis was the normalized fluorescence intensity signals, which is an arbitrary unit refered to as RFI (relative fluorescence intensity).
Conventional statistics
Patient baseline clinical characteristics (Day 1 of ICU admission) were reported as median (IQR) for continuous variables and frequency (%) for categorical variables. A Kruskal-Wallis comparison of the individual proteins was conducted between the healthy controls, ICU non-COVID-19 patients, and ICU COVID-19 patients on Day 1, followed by pairwise comparison with a post-hoc Dunn test that included a false discovery rate (Benjamini-Hochberg) correction. A paired comparison of protein expression on multiple days was conducted using the Wilcoxon Signed-Rank test with Bonferroni correction to assess changes during the ICU stay. Only multiple comparison-corrected P-values are reported, and those below 0.05 were considered statistically significant.
Machine learning
The data was split into a feature selection dataset (70%) and a testing dataset (30%), stratified by subject groups (Supplemental Fig. 1). The feature selection was done on Day 1 data that combined healthy controls and ICU non-COVID-19 patients and compared them against ICU COVID-19 patients. The combined cohort ensures the selection of the most significant proteins relevant to both healthy controls and ICU non-COVID-19 patients, which may improve clinical translation. The Boruta feature selection algorithm, based on Random Forest classifiers, was used to identify the most important proteins [37]. It individually compares each protein to randomly arranged versions of the data to determine if the protein is better at classifying than chance. The results from the Boruta feature reduction identified the most relevant proteins for classifying COVID-19 (a “reduced protein signature”). To assess the classification ability, two separate Random Forest classifiers were created to assess healthy controls versus ICU COVID-19 patients and ICU non-COVID-19 patients versus ICU COVID-19 patients.
Steps were undertaken to conduct a conservative analysis that mitigates small sample sizes and overfitting concerns. The Boruta algorithm was run on the feature reduction dataset to determine the most relevant features. The testing dataset was modified to contain only the identified relevant features. The reduced testing dataset was then used for the classification of COVID-19 with a Random Forest classifier. To reduce overfitting and maintain a conservative model, three-fold cross-validation was used with a Random Forest of 10 trees and a maximum depth of 3 [38]. The accuracy, receiver operating characteristic (ROC) curve area-under-curve (AUC), and the F1 score are reported. A high F1 score indicated that precision and recall are high.
As a Random Forest is a set of decision trees, we were able to interrogate this collection of trees to identify the features that have the highest predictive value (viz., those features that frequently appear near the top of the decision tree). Based on this characteristic, recursive feature elimination (RFE) was used to prepare an optimal model. RFE started with the reduced training dataset, fitted a Random Forest classifier, dropped the least important feature, and repeated the process until only ten features remained. Due to the randomness of the algorithm and Random Forest models, 10,000 runs of RFE were conducted. Those features in the top 10 for more than a specified threshold of the 10,000 runs were determined to be the optimal features. An optimal testing dataset containing only these optimal features was generated from the reduced testing dataset. The same classification process for the reduced testing dataset was used on the optimal testing dataset.
The proteomic data was visualized with a nonlinear dimensionality reduction on the reduced and optimal datasets using the t-distributed stochastic nearest neighbour embedding (t-SNE) algorithm. A t-SNE assumes that the ‘optimal’ representation of the data lies on a manifold with complex geometry, but in a low dimension, embedded in the full-dimensional space of the raw data [39]. Seperate t-SNE plots were constructed with all participants on the complete, reduced, and optimal datasets for visual comparison of clustering patterns. A pairwise comparison, using cosine similarity, was conducted to determine the similarity between subjects across the selected proteins and time points [40]. As such, subjects similar across their selected proteomic profile have a score closer to 1, while dissimilar subjects have a score closer to 0. The analysis was done with data Min-Max scaled between 0 and 1, and the cosine similarities were visualized using a heatmap.
The indivudal protein performance for distinguishing ICU COVID-19 patients from healthy controls and ICU non-COVID-19 patients was compared using a bootstrap Logistic Regression approach with 1000 repetitions. The participants were sampled with replacement, and three-fold cross-validation was used. The mean ROC AUC, sensitivity, specificity, and the F1 score are reported. The machine learning analysis was conducted using Python version 3.10.11 and Scikit-Learn version 1.2.2 [41].
Natural Language Processing
In order to identify physiological domains of interest in COVID-19 patients, exploratory expression analysis was conducted with Natural Language Processing (NLP). Expertly curated mRNA/protein expression information was parsed from the Uniprot Knowledgebase as unstructured text with UniProt’s REST API [42]. An NLP named-entity recognition (NER) pipeline was configured with the MIMIC package for preprocessing, negation detection, and the pretrained Stanza BioNLP13CG Biomedical model (Python v. 3.10.11; spaCy v. 3.3.1; spaCy-Stanza v. 1.0.2; negspaCy v. 1.0.3) [43,44,45]. The negation detection was done using the NegEx-based negspaCy implementation with a modified English clinical term set to filter negative expression terms. Although the BioNLP13CG biomedical model was based on Cancer Genetics and publicly available PubMed abstracts, compared to the other Stanza models, it provided the most granular entity classification, including anatomical system, organ, tissue, multi-level tissue, and cell type entities. The detected organ and cell type entities were manually classified into keyword-based groups separately. The manual expression curation process relies on existing literature and is not easily structured into specific organ systems. The organ, tissue, multi-tissue, and anatomical system entity types were combined and manually sorted into organ systems to include the maximum expression information in the analysis. The frequency of the keyword-based categories with respect to the relevant proteins was determined to identify physiological patterns of expression.
Results
A total of three age- and sex-matched groups were included, consisting of COVID-19 patients (median years old = 60; IQR = 12; n = 15), non-COVID-19 sepsis patients (median years old = 57; IQR = 11; n = 15), and healthy control subjects (median years old = 56; IQR = 10; n = 15). There were no significant differences in age (Kruskal-Wallis H-test, P = 0.87) and sex (Chi-Square, P = 1.000) between the three cohorts. Baseline demographic characteristics, comorbidities, laboratory measurements, interventions, and chest x-ray findings of COVID-19 and non-COVID-19 sepsis controls are reported in Table 1. The two cohorts were generally similar in terms of their demographics, comorbidities, and interventions, except that COVID-19 patients had longer intubation periods and greater ICU days. While all ICU patients met the Sepsis 3.0 presentation criteria, only 40% of the non-COVID-19 ICU patients had pathogen identified. The COVID-19 patients were more likely to have bilateral pneumonia, lower white blood cell and lymphocyte counts, higher INR and PTT, and a lower PaO2/FiO2 ratio.
The expression levels of 2,000 proteins (1,968 unique proteins) were measured (Supplemental Fig. 2), and the cohorts plotted with t-SNE (all 2,000 proteins; Supplemental Fig. 3). Using Boruta feature selection machine learning, the leading 28 proteins were identified in comparing ICU COVID-19 patients to ICU non-COVID-19 patients (ICU day 1 for both) and healthy control subjects, and their relative importance is provided in Table 2. The leading 28 protein model had high classification ability when comparing ICU COVID-19 patients to ICU non-COVID-19 patients (accuracy = 0.89, AUC = 1.00, F1 = 0.89) as well as when comparing ICU COVID-19 patients to healthy control subjects (accuracy = 0.89, AUC = 1.00, F1 = 0.88). Individually, each of the 28 proteins was significantly different in COVID-19 patients compared to non-COVID-19 patients (FDR adjusted P < 0.05). When compared to healthy controls, 4 proteins out of the 28 (Galanin, ProSAAS, VimentinB, and NET1) were not significantly different from COVID-19 patients. Of the 28 proteins, only four had overall elevated levels in the COVID-19 patients (Fyn, Fen1, Azurocidin, and NET1; Supplemental Fig. 4). The individual classification abilities varied between the 28 proteins and is provided in Supplemental Tables 1 and Supplemental Table 2. Visualizing the 28 protein classification ability on Day 1 using t-SNE plots demonstrated a distinct COVID-19 patient cluster separation (one outlier) from healthy controls (Fig. 1A) as well as non-COVID-19 patients (Fig. 1C). The functions of the 28 proteins are described in Supplemental Table 3.
Recursive feature elimination was used to determine a set of optimal proteins. Those proteins in the top 10 for at least 5,000 of the 10,000 RFE repetitions (50%) were selected as the optimal protein model. Nine of the 28 proteins were optimal: PF4V1, NUCB1, CrkL, SerpinD1, Fen1, GATA-4, ProSAAS, PARK7, and NET1 (Supplemental Fig. 5). The optimal set of proteins maintained a high classification ability between COVID-19 patients on Day 1 and healthy controls (accuracy = 0.97, ROC = 0.97, F1 = 0.96) as well as between COVID-19 and non-COVID-19 on Day 1 (accuracy = 0.93, ROC = 1.00, F1 = 0.92). All proteins were significantly different in COVID-19 patients from non-COVID-19 patients, while ProSAAS and NET1 were not significantly different between COVID-19 and healthy controls (FDR-adjusted P < 0.05). Of the 9 proteins, only Fen1 and NET1 were elevated in COVID-19 patients. Visually, the t-SNE plots based on the nine optimal proteins illustrate a separation between the COVID-19 patients and healthy controls, with two outliers (Fig. 1B) as well as a distinct separation between COVID-19 patients and ICU non-COVID-19 patients (Fig. 1D).
Pairwise cosine similarity between all subjects and available time points was calculated to compare the cohorts in terms of their reduced and optimal protein profiles, presented in Fig. 1E and F, respectively. The healthy control subjects have the most homogenous protein profiles in both the 28 and 9 protein models. The non-COVID-19 sepsis controls were relatively homogenous across ICU Days 1 and 3, with observable differences from healthy control subjects. The COVID-19 patients are distinct at all time points from the other cohorts. Compared to the 28 protein profile, the COVID-19 patients are more homogenous across time points with the 9 protein profile. The expression of the leading proteins in COVID-19 patients on ICU Days 3, 7, and 10 were compared to their ICU Day 1 expression and demonstrated no significant differences over time (P > 0.05; data not shown).
The relevant leading 28 protein measurements of the COVID-19 patients were compared to their clinical variables. A total of seven significant associations (P < 0.01) were identified and are presented in Figs. 2 and 3. Fibronectin levels in all COVID-19 patients were below healthy control subjects and demonstrated a negative correlation with hemoglobin (Fig. 2A). Most COVID-19 patients’ PCTM1 measurements were below healthy control subjects and negatively correlated with INR (Fig. 2B). SerpinB5, ERRa, and IGFBP-5 in COVID-19 patients were all positively correlated with PTT, and most patients had measurement levels below healthy control subjects (Fig. 2C-E). MammaglobinA was lower in COVID-19 patients who received high-flow nasal cannula intervention (Fig. 3A). ProSAAS was lower in patients with hypertension comorbidity (Fig. 3B).
Named-entity recognition was conducted on the tissue expression information provided by the UniProt Knowledgebase. Out of the 28 leading proteins, 14 (50%) had organ expression information (Supplemental Table 4), and 8 (29%) had cell type expression information (Supplemental Table 5). The percentage of the 14 proteins expressed in specific organ systems, led by the digestive and nervous systems, is shown in Fig. 4. The percentage of the eight proteins expressed in specific cell types is shown in Supplemental Fig. 6.
Discussion
In this study, we measured the expression of 2,000 plasma proteins with antibody micro-array technology from age- and sex-matched COVID-19 patients, non-COVID-19 sepsis controls, and healthy control subjects. Using machine learning-based protein subset identification, we identified a 28-protein model that accurately differentiated COVID-19 patients from their comparison cohorts. Furthermore, we determined an optimal 9-protein subset model that maintained high classification ability. Some identified proteins were associated with clinical and demographic characteristics in the COVID-19 patients. NLP of expert-curated expression information identified multi-system expression of the leading proteins. This study has identified a reduced protein signature for COVID-19 patients that contributes to COVID-19 pathophysiology characterization and may inform the development of therapeutic interventions upon further investigation.
Our critically ill COVID-19 cohort was similar to other reported cohorts, with only minor differences [8, 46,47,48,49]. For example, the mortality rate in our COVID-19 patients was higher than reported by other studies and may suggest a greater illness burden in our patients [8, 46, 50]. The platelet count in our COVID-19 patients was lower than reported in the literature [51,52,53], perhaps reflecting greater microvascular injury and overall microclot risk [23]. Similarly, the PaO2/FiO2 ratio was also lower in our COVID-19 patients [8, 53], indicating higher levels of acute lung injury. Although COVID-19 lymphocyte counts, INR, and bilateral pulmonary complications were significantly different than in non-COVID-19 sepsis controls, they were similar to those in COVID-19 patients reported in the literature [49, 52, 53].
A unique 28-protein signature that differentiated COVID-19 patients from non-COVID-19 sepsis controls and healthy control subjects was determined. Each of the identified proteins was individually different in the COVID-19 cohort from the non-COVID-19 cohort, as well as 24 proteins were different in the healthy control subjects. Many proteins had high individual distinguishing power, further positioning them as possible disease biomarkers. Those proteins with weaker individual performance may be beneficial in a combination or secondary role. Time-based analysis and inspection of the pairwise subject comparison demonstrated no changes in COVID-19 protein expression over multiple ICU days and interventions, suggesting that the reduced protein signature is robust, reproducible, and remains highly predictive of COVID-19 disease status over 10 hospitalization days. In addition, an optimal model consisting of 9 proteins (PF4V1, NUCB1, CrkL, SerpinD1, Fen1, GATA-4, ProSAAS, PARK7, and NET1) maintained the high classification ability found in the superset 28-protein model. The pairwise comparison analysis suggests that the nine-protein model may be more consistent across multiple days than the 28-protein model.
Correlation analysis comparing the expression of the 28-protein in COVID-19 patients with their respective clinical characteristics identified seven associations. Interestingly, five proteins correlated with measures of blood clotting, including the INR and PTT. The COVID-19 patients had significantly higher INR and PTT measurements compared to non-COVID-19 sepsis controls; however, the measurements were within the normal clinical range. Almost all patients across the two ICU cohorts had anticoagulation interventions. PCMT1 was negatively correlated with INR in COVID-19 patients but not linked to thrombosis in the literature. SerpinB5, ERRa, and IGFBP-5 measurements in COVID-19 patients were mainly lower than healthy controls and exhibited a positive correlation with PTT; however, similar to PCMT1, none of the correlated proteins have been linked to thrombosis previously. Hemoglobin was negatively correlated with fibronectin in COVID-19 patients, with all patients having fibronectin levels lower than healthy controls. MammaglobinA, a secreted glycosylated proteins involved in cell signalling and the immune response, differentiated COVID-19 patients who received high-flow nasal cannula oxygen therapy as an intervention [54, 55]. Lastly, ProSAAS, a neuroendocrine hormone, was lower in those patients with pre-existing hypertension [56].
Serpins are a family of protease inhibitors that use conformational changes to inhibit target enzymes [57]. Four of the 28 proteins that changed in COVID-19 were Serpins (A1, D1, A4, and A12), and all were downregulated. In line with a previous study, SerpinA1 was downregulated in our COVID-19 cohort [58]. SerpinA1 is proposed to limit SARS-CoV-2 cell entry via inhibition of cell surface transmembrane protease 2 (TMPRSS2) function, a critical step in the required processing of the SARS-CoV-2 spike protein [59]. In addition, SerpinA1 was associated with decreased COVID-19 severity [60, 61], and suggested as a potential COVID-19 treatment. Indeed, COVID-19 patients with moderate to severe acute respiratory distress syndrome improved in a phase 2 randomized control trial after SerpinA1 intervention [62]. Administration of SerpinA1 is also suggested as a therapy for alpha-1-antitrypsin deficiency (AATD), in which there is an increased risk of emphysema, obstructive lung disease, and liver disease [63,64,65,66,67,68]; however, it is unclear if AATD mutations are associated with COVID-19 severity [61, 69, 70]. SerpinD1, a thrombosis inhibitor [71], competes with the SARS-CoV-2 spike protein to bind heparin, resulting in increased thrombosis risk [72]. The regulation of SerpinD1 in COVID-19 is controversial, as a study has shown that SerpinD1 was higher in moderate and severe cases [73]. SerpinA4, also known as kallistatin, exerts multiple effects on inflammation, angiogenesis, and tumor growth. A single nucleotide polymorphism in the SerpinA4 gene was linked to acute kidney injury in COVID-19 patients [74]. Down-regulation of SerpinA4 was noted in COVID-19 non-survivors, indicating a persistent pro-inflammatory signature [75]. SerpinA12 is an adipokine that has been linked to the development of insulin resistance, obesity, and inflammation [76]. In COVID-19, the downregulation of SerpinA12 may heighten inflammation via the kallikrein–kinin system [77].
NLP analysis processed expert-curated expression information from the UniProt Knowledgebase to identify organ- and cell-specific proteins. Of the 28 proteins, 14 (50%) had organ system expression information, with most proteins linked to expression in the digestive and nervous systems. NLP cell-type analysis results were inconclusive, as only eight proteins had cell-type expression information.
Gastrointestinal system complications are prevalent in COVID-19 patients, including diarrhea, nausea/vomiting, and abdominal pain [9, 78, 79]. Fen1, involved in critical DNA synthesis and repair mechanisms, was overexpressed in our COVID-19 cohort. Fen1 is reported to be involved in hepatocellular and gastrointestinal cancers [80, 81], and a novel antiviral strategy that utilizes FEN1 to decrease SARS-CoV-2 cellular functions has been proposed [82]. The expression of both CrkL and fibronectin was decreased in our COVID-19 cohort. The former, which is associated with gastrointestinal cancers, has been suggested as a potential COVID-19 drug target [83,84,85]. The latter is a widely expressed extracellular matrix protein associated with liver regeneration, fibrogenesis, and intestinal inflammation [86,87,88].
Nervous system symptoms in COVID-19 patients are prevalent, with COVID-19 severity being associated with increased neurological complications [89,90,91]. Our NLP analysis identified proteins, mainly down-regulated, from our COVID-19 cohort that are linked to the nervous system. SHANK1, downregulated in COVID-19 patients, facilitates protein-protein interactions in excitatory synapses [92], and its downregulation may hinder neuronal communication [93]. Our COVID-19 patients had decreased expression of PCMT1, a carboxyl methyltransferase. PCMT1 downregulation is linked to neurodegenerative diseases and may increase ß-amyloid production [94, 95]. PARK7 is decreased in our COVID-19 patients and may not effectively perform its protective role against neurotoxicity and neuronal viability [96,97,98]. PARK7 performs various cellular functions, including acting as a chaperone, interacting with transcription factors, and being involved in anti-oxidative properties under oxidative stress conditions [99,100,101]. PARK7 is a critical protein involved in the gut-brain axis and related to altered gut microbiomes [102, 103]. Nucleobinding 1 (NUCB1) is widely expressed in brain neurons and stabilizes amyloid protofibrils before they mature and become harmful in neurodegenerative diseases [104, 105]; however, its downregulation in our COVID-19 patients suggests decreased neurological protective mechanisms. Presenilin2 is a crucial protein in neurodegenerative disease and was decreased in our COVID-19 patients. Presenilin2 is responsible for the cleaving enzymatic action required to form amyloid plaques and also forms Ca2+ leak channels that support the calcium hypothesis of AD [106,107,108,109]. Similar to Presenillin2, ProSAAS, an amyloid anti-aggregant in Alzheimer’s disease, is decreased in our COVID-19 patients [110]. ProSAAS is a neuroendocrine chaperone protein with protective effects against neurodegeneration, such that increased endocrine and neurological cell stressors are associated with elevated expression [111, 112]. Galanin was downregulated in our COVID-19 patients and operates on the neuroendocrine axis with various functions throughout the central and peripheral nervous and endocrine systems [113]. Fyn, elevated in our COVID-19 cohort, has a harmful role in neurological diseases and may be a potential target for neurodegenerative disease due to its ß-amyloid signalling and tau interactions [114,115,116].
NLP analysis also identified the endocrine system as potentially impacted due to differential protein expression. COVID-19 patients with hypertension had significantly lower expression of ProSAAS, which may be related to ProSAAS peptides involved in salt sensitivity [117]. Diabetes diagnosis and insulin sensitivity have been linked to COVID-19 severity and mortality [118,119,120], and downregulated ERRa in our COVID-19 cohort is linked to insulin resistance, diabetes, and obesity [121,122,123,124]. ERRa regulates glycolysis and lipid metabolism in multiple organs, along with steroidogenesis in the adrenal cortex [125,126,127]. Similar to our cohort, lower IGFBP-5 expression was previously observed in COVID-19 patients [128], and IGFBPs are linked to diabetes and metabolic disorders [129,130,131,132,133]. SerpinA12 was down-regulated in our COVID-19 patients and is associated with diabetes and obesity due to its insulin-sensitizing effects [134,135,136,137,138]. The downregulated NUCB1 in our COVID-19 patients suggested a harmful effect related to type 2 diabetes as it performs amyloid stabilization in human islet cells to prevent fibrils in the pancreas that impact type 2 diabetes [104, 139, 140]. The decreased PARK7 in COVID-19 patients could also be connected to a metabolic imbalance. PARK7 protects pancreatic beta-cells from oxidative stress conditions, and its deficiency is associated with decreased inflammatory and adipogenesis responses [141,142,143] and type 2 diabetes [144, 145]. Lastly, Presenilin 2 is expressed in endocrine cells, but there is insufficient data on its role and association with diabetes [146, 147].
COVID-19 is linked with various cardiovascular changes, including vascular transformation, thrombosis, and angiogenesis [148,149,150,151,152,153]. NLP analysis revealed proteins expressed in the cardiovascular system. GATA-4 is involved in cardiac remodelling, differentiation, and signalling by acting as a cardiogenic transcription factor [154,155,156]. GATA-4 was reduced in our COVID-19 patients, indicating that subsequent remodelling pathways may be impaired. IGFBP-5 expression was reduced in COVID-19 patients [128], and it is an inhibitor of angiogenesis and vascular smooth muscle cell proliferation [157,158,159]. PF4V1, decreased in our COVID-19 patients, is an angiogenesis inhibitor and may also regulate inflammation and thrombosis [160,161,162,163]. SerpinA4 (Kallistatin) was lower in our COVID-19 patients [164], and it protects against vascular oxidative stress and inflammation as well as inhibiting angiogenesis [165,166,167]. Thus, the decreased expression of IGFBP-5, PF4V1, and SeprinA4 in COVID-19 may be cardioprotective, perhaps via suppression of angiogenesis and vascular transformation. EphB4, also associated with angiogenesis, was downregulated in our COVID-19 patients [168,169,170,171].
The novelties of this study include the proteins identified, the immune microarray platform utilized, and several of the analytic techniques. Previous proteomics studies have also identified molecular models that differentiate COVID-19 patients from non-COVID-19 sepsis controls and healthy control participants [172,173,174,175]. While these studies identify a number of important molecules, they did not evaluate their effectiveness in a single combined model, which decreases the likelihood of cross-identity concerns with other diseases. The novel proteins identified in our study may be attributed to our use of an immune microarray platform, while other studies utilized mass spectrometry or proximity extension assays [172,173,174,175,176,177]. Pathway analysis was used in previous studies to help understand COVID-19 pathophysiology [174, 176, 177]; however, our approach utilized NLP to identify organ and cell expression patterns.
In this study, we identified a novel 28-protein signature and an optimal 9-protein signature that accurately classifies COVID-19 patients from non-COVID-19 sepsis controls and healthy control subjects; however, our study has several limitations. First, the number of subjects in each comparison group was limited to 15, which impacted the choice of analytic. Conservative methods were used to avoid common overfitting or non-generalizable results. Conventional statistics consisted of only non-parametric methods with strict multiple comparison correction. Machine learning classification utilized cross-validation with conservative parameters and without any hyperparameter tuning. Also, protein model building and testing consisted of separate data subsets to reduce overfitting. Second, not all identified proteins had UniProt Knowledgebase-curated expression information, leaving the potential for unrecognized patterns in organ and cell system expression. Similarly, there is a possibility for missed organ/cell identification with NLP; however, preprocessing of expression information was carefully done, and NER used a state-of-the-art biomedical model. Third, static protein measurements must be interpreted with caution as they do not always correlate with functional changes. As one example, Serpins undergo a conformational change to elicit biological effects and therefore require further functional analyses. Lastly, we only compared the COVID-19 proteome signatures to other cohorts, but there may be cross-identity concerns with other illnesses. The use of multiple proteins would reduce this latter limitation. Although our exploratory study had these minor constraints, the data provided insight into the pathophysiological changes in COVID-19 patients.
Conclusion
Our understanding of COVID-19 pathophysiology, especially in critically ill patients, is incomplete due to its multi-system complications. We identified 28 proteins that accurately differentiate COVID-19 ICU patients from non-COVID-19 sepsis ICU controls and healthy control subjects. The leading proteins are expressed in multiple organ systems and are associated with various diseases and pathophysiological functions, including diabetes, neurodegeneration, metabolic processes, and vascular transformation. The results of our proteomic exploratory study offer insightful information about COVID-19 and might aid in the development of future treatments.
Data availability
The datasets generated and/or analysed during the current study are available from the corresponding author on reasonable request.
Abbreviations
- AUC:
-
Area-under-the-curve ROC
- COVID-19:
-
Coronavirus Disease 2019
- FiO2 :
-
Fractional Inspired Oxygen
- ICU:
-
Intensive Care Unit
- INR:
-
International Normalized Ratio
- MODS:
-
Multiple Organ Dysfunction Score
- NER:
-
Named Entity Recognition
- NLP:
-
Natural Language Processing
- PaO2 :
-
Partial Pressure of Oxygen
- PTT:
-
Partial Thromboplastin Time
- RFE:
-
Recursive Feature Elimination
- ROC:
-
Receiver Operating Characteristic Curves
- SARS:
-
Severe Acute Respiratory Syndrome
- SARS-CoV-2:
-
Severe Acute Respiratory Syndrome Coronavirus 2
- SOFA:
-
Sequential Organ Failure Assessment Score
- t-SNE:
-
T-distributed Stochastic Nearest Neighbor Embedding algorithm
References
Harrison AG, Lin T, Wang P. Mechanisms of SARS-CoV-2 transmission and Pathogenesis. Trends Immunol. 2020;41:1100–15.
WHO COVID-19. Dashboard [Internet]. June 01 2023. Geneva: World Health Organization; [cited June 01 2023]. https://covid19.who.int/
Modes ME, et al. Clinical characteristics and outcomes among adults hospitalized with laboratory-confirmed SARS-CoV-2 infection during periods of B.1.617.2 (Delta) and B.1.1.529 (omicron) variant predominance - one hospital, California, July 15-September 23, 2021, and December 21, 2021-January 27, 2022. MMWR Morb Mortal Wkly Rep. 2022;71:217–23.
Rzymski P, Kasianchuk N, Sikora D, Poniedziałek B. COVID-19 vaccinations and rates of infections, hospitalizations, ICU admissions, and deaths in Europe during SARS-CoV-2 Omicron wave in the first quarter of 2022. J Med Virol. 2023;95:e28131.
Mhawish H, et al. Comparison of severity of immunized versus non-immunized COVID-19 patients admitted to ICU: a prospective observational study. Annals Med Surg. 2021;71:102951.
Moghadas SM, et al. The impact of vaccination on Coronavirus Disease 2019 (COVID-19) outbreaks in the United States. Clin Infect Dis. 2021;73:2257–64.
Covid C, et al. COVID-19 vaccine breakthrough infections reported to CDC—United States, January 1–April 30, 2021. Morb Mortal Wkly Rep. 2021;70:792.
Grasselli G et al. Baseline Characteristics and Outcomes of 1591 Patients Infected With SARS-CoV-2 Admitted to ICUs of the Lombardy Region, Italy. JAMA. 2020.
Kartsonaki C, et al. Characteristics and outcomes of an international cohort of 600 000 hospitalized patients with COVID-19. Int J Epidemiol. 2023;52:355–76.
Wu Z, McGoogan JM. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China: summary of a report of 72 314 cases from the Chinese Center for Disease Control and Prevention. JAMA. 2020;323:1239–42.
Hu B, Guo H, Zhou P, Shi Z-L. Characteristics of SARS-CoV-2 and COVID-19. Nat Rev Microbiol. 2021;19:141–54.
Zhou Y, et al. Comorbidities and the risk of severe or fatal outcomes associated with coronavirus disease 2019: a systematic review and meta-analysis. Int J Infect Dis. 2020;99:47–56.
Jain V, Yuan J-M. Predictive symptoms and comorbidities for severe COVID-19 and intensive care unit admission: a systematic review and meta-analysis. Int J Public Health. 2020;65:533–46.
Ahlström B, et al. The Swedish covid-19 intensive care cohort: risk factors of ICU admission and ICU mortality. Acta Anaesthesiol Scand. 2021;65:525–33.
Bart GP, et al. Demographic risk factors for COVID-19 infection, severity, ICU admission and death: a meta-analysis of 59 studies. BMJ Open. 2021;11:e044640.
Del Valle DM, et al. An inflammatory cytokine signature predicts COVID-19 severity and survival. Nat Med. 2020;26:1636–43.
Han H, et al. Profiling serum cytokines in COVID-19 patients reveals IL-6 and IL-10 are disease severity predictors. Emerg Microbes Infections. 2020;9:1123–30.
Fara A, Mitrev Z, Rosalia RA, Assas BM. Cytokine storm and COVID-19: a chronicle of pro-inflammatory cytokines. Open Biology. 2020;10:200160.
Yang L, et al. COVID-19: immunopathogenesis and immunotherapeutics. Signal Transduct Target Therapy. 2020;5:128.
Mahmudpour M, Roozbeh J, Keshavarz M, Farrokhi S, Nabipour I. COVID-19 cytokine storm: the anger of inflammation. Cytokine. 2020;133:155151.
Fraser DD, et al. Novel outcome biomarkers identified with targeted proteomic analyses of plasma from critically ill coronavirus Disease 2019 patients. Crit Care Explor. 2020;2:e0189.
Fraser DD, et al. Inflammation profiling of critically ill coronavirus Disease 2019 patients. Crit Care Explor. 2020;2:e0144.
Fraser DD, et al. Endothelial Injury and Glycocalyx Degradation in critically ill coronavirus Disease 2019 patients: implications for microvascular platelet aggregation. Crit care Explorations. 2020;2:e0194–0194.
Gorog DA, et al. Current and novel biomarkers of thrombotic risk in COVID-19: a Consensus Statement from the International COVID-19 thrombosis biomarkers Colloquium. Nat Reviews Cardiol. 2022;19:475–95.
Yao Y, et al. D-dimer as a biomarker for disease severity and mortality in COVID-19 patients: a case control study. J Intensive Care. 2020;8:49.
Cabrera-Garcia D, et al. Plasma biomarkers associated with survival and thrombosis in hospitalized COVID-19 patients. Int J Hematol. 2022;116:937–46.
Fraser DD, et al. Detection and profiling of human coronavirus immunoglobulins in critically ill coronavirus Disease 2019 patients. Crit Care Explor. 2021;3:e0369.
Fraser DD, et al. Cohort-specific serological recognition of SARS-CoV-2 variant RBD antigens. Ann Clin Lab Sci. 2022;52:651–62.
Fraser DD, et al. Critically ill COVID-19 patients exhibit Anti-SARS-CoV-2 serological responses. Pathophysiology. 2021;28:212–23.
Iosef C, et al. COVID-19 plasma proteome reveals novel temporal and cell-specific signatures for disease severity and high-precision disease management. J Cell Mol Med. 2023;27:141–57.
Singer M, et al. The Third International Consensus definitions for Sepsis and septic shock (Sepsis-3). JAMA. 2016;315:801–10.
CDC 2019-Novel Coronavirus (2019-nCoV) Real-Time RT-PCR Diagnostic Panel [Internet]. https://www.fda.gov/media/134922/download
NIH HRPP. POLICY: Guidelines for Limits of Blood Drawn for Research Purposes in the Clinical Center. M95-9 (rev.) June 5. 2009.
Brisson AR, Matsui D, Rieder MJ, Fraser DD. Translational research in pediatrics: tissue sampling and biobanking. Pediatrics. 2012;129:153–62.
Gillio-Meina C, Cepinskas G, Cecchini EL, Fraser DD. Translational research in pediatrics II: blood collection, processing, shipping, and storage. Pediatrics. 2013;131:754–66.
Priestap F, Kao R, Martin CM. External validation of a prognostic model for intensive care unit mortality: a retrospective study using the Ontario critical care information system. Can J Anaesth. 2020.
Kursa MB, Rudnicki W R. Feature Selection with the Boruta Package. Journal of Statistical Software. 2010;36(11):1–13.
Tang C, Garreau D, and von Luxburg U. When do random forests fail? In: NeurIPS. 2018;2987–97.
Van der Maaten L, Hinton G. Visualizing data using t-SNE. J Mach Learn Res. 2008;9.
Jambu M. Chap. 10 - classification of individuals–variables data sets. In: Jambu M, editor. Exploratory and Multivariate Data Analysis. Boston: Academic; 1991;305–405.
Pedregosa F, et al. Scikit-learn: machine learning in Python. J Mach Learn Res. 2011;12:2825–30.
Bateman A, et al. UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021;49:D480–9.
Zhang Y, Zhang Y, Qi P, Manning CD, Langlotz CP. Biomedical and clinical English model packages for the Stanza Python NLP library. J Am Med Inform Assoc. 2021;28:1892–9.
Qi P, Zhang Y, Zhang Y, Bolton J, Manning CD. Stanza: A Python natural language processing toolkit for many human languages. arXiv preprint arXiv:2003.07082. 2020.
Honnibal M, Montani I, Van Landeghem S, Boyd A. spaCy: Industrial-strength Natural Language Processing in Python. 2020.
Gupta S, et al. Factors Associated with Death in critically ill patients with Coronavirus Disease 2019 in the US. JAMA Intern Med. 2020;180:1436–47.
di Flora DC et al. Analysis of Plasma Proteins Involved in Inflammation, Immune Response/Complement System, and Blood Coagulation upon Admission of COVID-19 Patients to Hospital May Help to Predict the Prognosis of the Disease. Cells. 2023;12.
Völlmy F et al. A serum proteome signature to predict mortality in severe COVID-19 patients. Life Sci Alliance. 2021;4.
Bhatraju PK et al. Covid-19 in critically ill patients in the Seattle Region - Case Series. N Engl J Med. 2020.
Gutmann C, et al. SARS-CoV-2 RNAemia and proteomic trajectories inform prognostication in COVID-19 patients admitted to intensive care. Nat Commun. 2021;12:3406.
Alfraij A, et al. Characteristics and outcomes of coronavirus disease 2019 (COVID-19) in critically ill pediatric patients admitted to the intensive care unit: a multicenter retrospective cohort study. J Infect Public Health. 2021;14:193–200.
Alharthy A, et al. Clinical characteristics and predictors of 28-Day mortality in 352 critically ill patients with COVID-19: a retrospective study. J Epidemiol Glob Health. 2021;11:98–104.
COVID-ICU G. Clinical characteristics and day-90 outcomes of 4244 critically ill adults with COVID-19: a prospective cohort study. Intensive Care Med. 2021;47:60–73.
Zehentner BK, Carter D. Mammaglobin: a candidate diagnostic marker for breast cancer. Clin Biochem. 2004;37:249–57.
Han J-H, et al. Mammaglobin expression in Lymph Nodes is an important marker of metastatic breast carcinoma. Arch Pathol Lab Med. 2003;127:1330–4.
Fricker LD, et al. Identification and characterization of proSAAS, a Granin-Like neuroendocrine peptide precursor that inhibits Prohormone Processing. J Neurosci. 2000;20:639–48.
Law RH, et al. An overview of the serpin superfamily. Genome Biol. 2006;7:216.
Pertzov B, et al. Lower serum alpha 1 antitrypsin levels in patients with severe COVID-19 compared with patients hospitalized due to non-COVID-19 pneumonia. Infect Dis (Lond). 2022;54:846–51.
Azouz NP, et al. Alpha 1 antitrypsin is an inhibitor of the SARS-CoV-2-Priming protease TMPRSS2. Pathog Immun. 2021;6:55–74.
Rosendal E, et al. Serine protease inhibitors restrict host susceptibility to SARS-CoV-2 infections. mBio. 2022;13:e0089222.
Rodríguez Hermosa JL et al. Severe COVID-19 Illness and α1-Antitrypsin Deficiency: COVID-AATD Study. Biomedicines. 2023;11.
McElvaney OJ, et al. A randomized, double-blind, placebo-controlled trial of intravenous alpha-1 antitrypsin for ARDS secondary to COVID-19. Med. 2022;3:233–e248236.
Boëlle P-Y, Debray D, Guillot L, Corvol H, on behalf of the French CFMGSI. SERPINA1 Z allele is associated with cystic fibrosis liver disease. Genet Sci. 2019;21:2151–5.
Narayanan P, Mistry PK. Update on Alpha-1 Antitrypsin Deficiency in Liver Disease. Clin Liver Dis (Hoboken). 2020;15:228–35.
Lomas DA, Li-Evans D, Finch JT, Carrell RW. The mechanism of Z α1-antitrypsin accumulation in the liver. Nature. 1992;357:605–7.
Yang P, et al. Alpha1-Antitrypsin Deficiency Carriers, Tobacco smoke, Chronic Obstructive Pulmonary Disease, and Lung Cancer Risk. Arch Intern Med. 2008;168:1097–103.
Abboud RT, Nelson TN, Jung B, Mattman A. Alpha1-antitrypsin deficiency: a clinical-genetic overview. Application Clin Genet. 2011;4:55–65.
Strnad P, McElvaney NG, Lomas DA. Alpha1-Antitrypsin Deficiency. N Engl J Med. 2020;382:1443–55.
Rodríguez-García C, et al. Is SARS-COV-2 associated with alpha-1 antitrypsin deficiency? J Thorac Dis. 2023;15:711–7.
Sezgin Y, Becel S, Kaplan AK. Comparison of COVID-19 outcomes with Alpha-1 Antitrypsin Deficiency Prevalence in Europe: a cross-sectional study. Cureus. 2023;15:e34293.
He L, Vicente CP, Westrick RJ, Eitzman DT, Tollefsen DM. Heparin cofactor II inhibits arterial thrombosis after endothelial injury. J Clin Investig. 2002;109:213–9.
Zheng Y, et al. SARS-CoV-2 spike protein causes blood coagulation and thrombosis by competitive binding to heparan sulfate. Int J Biol Macromol. 2021;193:1124–9.
Toomer KH, et al. SARS-CoV-2 infection results in upregulation of plasminogen activator Inhibitor-1 and Neuroserpin in the lungs, and an increase in fibrinolysis inhibitors associated with disease severity. EJHaem. 2023;4:324–38.
El-Hefnawy SM et al. Potential impact of serpin peptidase inhibitor clade (A) member 4 SERPINA4 (rs2093266) and SERPINA5 (rs1955656) genetic variants on COVID-19 induced acute kidney injury. 2022;32:101023.
Demichev V, et al. A proteomic survival predictor for COVID-19 patients in intensive care. PLOS Digit Health. 2022;1:e0000007.
Kurowska P et al. Review: Vaspin (SERPINA12) Expression and Function in Endocrine Cells. Cells. 2021;10.
Wilczynski SA, Wenceslau CF, McCarthy CG, Webb RC. A Cytokine/Bradykinin storm comparison: what is the relationship between hypertension and COVID-19? Am J Hypertens. 2021;34:304–6.
Chang R, Elhusseiny KM, Yeh YC, Sun WZ. COVID-19 ICU and mechanical ventilation patient characteristics and outcomes-A systematic review and meta-analysis. PLoS ONE. 2021;16:e0246318.
Kaafarani HMA, et al. Gastrointestinal complications in critically ill patients with COVID-19. Ann Surg. 2020;272:e61–2.
Zhang Y et al. Upregulation of FEN1 Is Associated with the Tumor Progression and Prognosis of Hepatocellular Carcinoma. Dis Markers 2020;2020:2514090.
Liu L, et al. Functional FEN1 genetic variants contribute to risk of hepatocellular carcinoma, esophageal cancer, gastric cancer and colorectal cancer. Carcinogenesis. 2012;33:119–23.
Tian K, et al. AntiV-SGN: a universal antiviral strategy to combat both RNA and DNA viruses by destroying their nucleic acids without sequence limitation. Microb Biotechnol. 2022;15:2488–501.
Selvaraj G, Kaliamurthi S, Peslherbe GH, Wei DQ. Identifying potential drug targets and candidate drugs for COVID-19: biological networks and structural modeling approaches. F1000Res. 2021;10:127.
Liu CH, et al. Analysis of protein-protein interactions in cross-talk pathways reveals CRKL protein as a novel prognostic marker in hepatocellular carcinoma. Mol Cell Proteom. 2013;12:1335–49.
Lan B, et al. Downregulation of CRKL expression can inhibit tumorigenesis in colon cancer. Front Biosci (Landmark Ed). 2014;19:528–34.
Kolachala VL, et al. Epithelial-derived fibronectin expression, signaling, and function in intestinal inflammation. J Biol Chem. 2007;282:32965–73.
Pujades C, Forsberg E, Enrich C, Johansson S. Changes in cell surface expression of fibronectin and fibronectin receptor during liver regeneration. J Cell Sci. 1992;102(Pt 4):815–20.
Liu XY, et al. Fibronectin expression is critical for liver fibrogenesis in vivo and in vitro. Mol Med Rep. 2016;14:3669–75.
Karadaş Ö, Öztürk B, Sonkaya AR. A prospective clinical study of detailed neurological manifestations in patients with COVID-19. Neurol Sci. 2020;41:1991–5.
Di Carlo DT, et al. Exploring the clinical association between neurological symptoms and COVID-19 pandemic outbreak: a systematic review of current literature. J Neurol. 2021;268:1561–9.
Mao L, et al. Neurologic manifestations of hospitalized patients with Coronavirus Disease 2019 in Wuhan, China. JAMA Neurol. 2020;77:683–90.
Sheng M, Kim E. The Shank family of scaffold proteins. J Cell Sci. 2000;113(Pt 11):1851–6.
Shi R et al. Shank Proteins Differentially Regulate Synaptic Transmission. eNeuro 2017;4.
Bae N, et al. Knock-down of protein L-isoaspartyl O-methyltransferase increases β-amyloid production by decreasing ADAM10 and ADAM17 levels. Acta Pharmacol Sin. 2011;32:288–94.
Li W-X, et al. Systematic metabolic analysis of potential target, therapeutic drug, diagnostic method and animal model applicability in three neurodegenerative diseases. Aging. 2020;12:9882–914.
Zhang Y, et al. Elevated expression of DJ-1 (encoded by the human PARK7 gene) protects neuronal cells from sevoflurane-induced neurotoxicity. Cell Stress Chaperones. 2018;23:967–74.
Lind-Holm Mogensen F, Scafidi A, Poli A, Michelucci A. PARK7/DJ-1 in microglia: implications in Parkinson’s disease and relevance as a therapeutic target. J Neuroinflammation. 2023;20:95.
Peng L, et al. Effect of DJ-1 on the neuroprotection of astrocytes subjected to cerebral ischemia/reperfusion injury. J Mol Med (Berl). 2019;97:189–99.
Antipova D, Bandopadhyay R. Expression of DJ-1 in Neurodegenerative disorders. Adv Exp Med Biol. 2017;1037:25–43.
Hu S, et al. Molecular chaperones and Parkinson’s disease. Neurobiol Dis. 2021;160:105527.
Huang M, Chen S. DJ-1 in neurodegenerative diseases: Pathogenesis and clinical application. Prog Neurobiol. 2021;204:102114.
Singh Y, et al. DJ-1 (Park7) affects the gut microbiome, metabolites and the development of innate lymphoid cells (ILCs). Sci Rep. 2020;10:16131.
Pap D, Veres-Székely A, Szebeni B, Vannay Á. PARK7/DJ-1 as a therapeutic target in Gut-Brain Axis diseases. Int J Mol Sci. 2022;23.
Kanuru M, Aradhyam GK. Chaperone-like activity of Calnuc prevents amyloid aggregation. Biochemistry. 2017;56:149–59.
Bonito-Oliva A, Barbash S, Sakmar TP, Graham WV. Nucleobindin 1 binds to multiple types of pre-fibrillar amyloid and inhibits fibrillization. Sci Rep. 2017;7:42880.
Kimberly WT, Xia W, Rahmati T, Wolfe MS, Selkoe DJ. The transmembrane aspartates in presenilin 1 and 2 are obligatory for gamma-secretase activity and amyloid beta-protein generation. J Biol Chem. 2000;275:3173–8.
Tu H, et al. Presenilins form ER Ca2 + leak channels, a function disrupted by familial Alzheimer’s disease-linked mutations. Cell. 2006;126:981–93.
Berridge MJ. Calcium hypothesis of Alzheimer’s disease. Pflügers Archiv - Eur J Physiol. 2010;459:441–9.
Zampese E, et al. Presenilin 2 modulates endoplasmic reticulum (ER)-mitochondria interactions and Ca2 + cross-talk. Proc Natl Acad Sci U S A. 2011;108:2777–82.
Hoshino A, et al. A novel function for proSAAS as an amyloid anti-aggregant in Alzheimer’s disease. J Neurochem. 2014;128:419–30.
Shakya M, Yildirim T, Lindberg I. Increased expression and retention of the secretory chaperone proSAAS following cell stress. Cell Stress Chaperones. 2020;25:929–41.
Jarvela TS et al. The neural chaperone proSAAS blocks α-synuclein fibrillation and neurotoxicity. Proceedings of the National Academy of Sciences. 2016;113:E4708-E4715.
Zhu S, et al. Galanin family peptides: molecular structure, expression and roles in the neuroendocrine axis and in the spinal cord. Front Endocrinol (Lausanne). 2022;13:1019943.
Guglietti B, Sivasankar S, Mustafa S, Corrigan F, Collins-Praino LE. Fyn kinase activity and its role in neurodegenerative Disease Pathology: a potential Universal Target? Mol Neurobiol. 2021;58:5986–6005.
Kaufman AC, et al. Fyn inhibition rescues established memory and synapse loss in Alzheimer mice. Ann Neurol. 2015;77:953–71.
Nygaard HB, van Dyck CH, Strittmatter SM. Fyn kinase inhibition as a novel therapy for Alzheimer’s disease. Alzheimers Res Ther. 2014;6:8.
Kumar M, et al. Abstract 11: role of pro-saas peptides in Salt Resistance. Hypertension. 2020;76:A11–11.
Singh AK, Gupta R, Ghosh A, Misra A. Diabetes in COVID-19: prevalence, pathophysiology, prognosis and practical considerations. Diabetes Metabolic Syndrome: Clin Res Reviews. 2020;14:303–10.
Kamrath C, et al. Incidence of type 1 diabetes in children and adolescents during the COVID-19 pandemic in Germany: results from the DPV Registry. Diabetes Care. 2022;45:1762–71.
D’Souza D, et al. Incidence of diabetes in children and adolescents during the COVID-19 pandemic: a systematic review and Meta-analysis. JAMA Netw Open. 2023;6:e2321281–2321281.
Mootha VK et al. Errα and Gabpa/b specify PGC-1α-dependent oxidative phosphorylation gene expression that is altered in diabetic muscle. Proceedings of the National Academy of Sciences. 2004;101: 6570–6575.
Patch RJ, et al. Indazole-based ligands for estrogen-related receptor α as potential anti-diabetic agents. Eur J Med Chem. 2017;138:830–53.
Larsen LH, et al. Genetic analysis of the estrogen-related receptor α and studies of association with obesity and type 2 diabetes. Int J Obes. 2007;31:365–70.
Handschin C, Mootha VK. Estrogen-related receptor α (ERRα): a novel target in type 2 diabetes. Drug Discovery Today: Therapeutic Strategies. 2005;2:151–6.
Seely J, et al. Transcriptional regulation of Dehydroepiandrosterone Sulfotransferase (SULT2A1) by estrogen-related receptor α. Endocrinology. 2005;146:3605–13.
Tripathi M, Yen PM, Singh BK. Estrogen-related receptor alpha: an under-appreciated potential target for the treatment of metabolic diseases. Int J Mol Sci. 2020;21:1645.
Audet-walsh É, Giguére V. The multiple universes of estrogen-related receptor α and γ in metabolic control and related diseases. Acta Pharmacol Sin. 2015;36:51–61.
Karabulut Uzunçakmak S, Aksakal A, Kerget F, Aydın P, Halıcı Z. Evaluation of IGFBP5 expression and plasma osteopontin level in COVID-19 patients. Adv Med Sci. 2023;68:31–7.
Landau D, et al. Expression of insulin-like growth factor binding proteins in the rat kidney: effects of long-term diabetes. Endocrinology. 1995;136:1835–42.
Lee J-H, et al. Identification of pre-diabetic biomarkers in the progression of diabetes Mellitus. Biomedicines. 2022;10:72.
Owerbach D, et al. Analysis of candidate genes for susceptibility to type I diabetes: a case-control and family-association study of genes on chromosome 2q31-35. Diabetes. 1997;46:1069–74.
Rodgers BD, Bautista RM, Nicoll CS. Regulation of insulin-like growth factor-binding proteins in rats with insulin-dependent diabetes mellitus. Proc Soc Exp Biol Med. 1995;210:234–41.
Song C, et al. IGFBP5 promotes diabetic kidney disease progression by enhancing PFKFB3-mediated endothelial glycolysis. Cell Death Dis. 2022;13:340.
von Loeffelholz C, et al. Circulating vaspin is unrelated to insulin sensitivity in a cohort of nondiabetic humans. Eur J Endocrinol. 2010;162:507–13.
Feng R, et al. Higher vaspin levels in subjects with obesity and type 2 diabetes mellitus: a meta-analysis. Diabetes Res Clin Pract. 2014;106:88–94.
Youn B-S, et al. Serum vaspin concentrations in human obesity and type 2 diabetes. Diabetes. 2008;57:372–7.
Jian W, et al. Role of serum vaspin in progression of type 2 diabetes: a 2-Year Cohort Study. PLoS ONE. 2014;9:e94763.
Klöting N, et al. Vaspin gene expression in human adipose tissue: Association with obesity and type 2 diabetes. Biochem Biophys Res Commun. 2006;339:430–6.
Williams P, Tulke S, Ilegems E, Berggren P-O, Broberger C. Expression of nucleobindin 1 (NUCB1) in pancreatic islets and other endocrine tissues. Cell Tissue Res. 2014;358:331–42.
Gupta R, Kapoor N, Raleigh DP, Sakmar TP. Nucleobindin 1 caps human islet amyloid polypeptide protofibrils to prevent amyloid fibril formation. J Mol Biol. 2012;421:378–89.
Inberg A, Linial M. Protection of pancreatic beta-cells from various stress conditions is mediated by DJ-1. J Biol Chem. 2010;285:25686–98.
Jain D, et al. DJ-1 protects pancreatic Beta cells from cytokine- and streptozotocin-mediated cell death. PLoS ONE. 2015;10:e0138535.
Kim JM, et al. DJ-1 contributes to adipogenesis and obesity-induced inflammation. Sci Rep. 2014;4:4805.
Eberhard D, Lammert E. The role of the antioxidant protein DJ-1 in type 2 diabetes Mellitus. Adv Exp Med Biol. 2017;1037:173–86.
Jain D, et al. Age- and diet-dependent requirement of DJ-1 for glucose homeostasis in mice with implications for human type 2 diabetes. J Mol Cell Biol. 2012;4:221–30.
Jaikaran ET, et al. Localisation of presenilin 2 in human and rodent pancreatic islet beta-cells; Met239Val presenilin 2 variant is not associated with diabetes in man. J Cell Sci. 1999;112(Pt 13):2137–44.
Li Y, et al. A Presenilin/Notch1 pathway regulated by miR-375, miR-30a, and miR-34a mediates glucotoxicity induced-pancreatic beta cell apoptosis. Sci Rep. 2016;6:36136.
Ackermann M et al. Pulmonary vascular endothelialitis, thrombosis, and Angiogenesis in Covid-19. N Engl J Med. 2020.
Lodigiani C, et al. Venous and arterial thromboembolic complications in COVID-19 patients admitted to an academic hospital in Milan, Italy. Thromb Res. 2020;191:9–14.
Varga Z, et al. Endothelial cell infection and endotheliitis in COVID-19. Lancet. 2020;395:1417–8.
Klok FA, et al. Incidence of thrombotic complications in critically ill ICU patients with COVID-19. Thromb Res. 2020;191:145–7.
Middeldorp S, et al. Incidence of venous thromboembolism in hospitalized patients with COVID-19. J Thromb Haemost. 2020;18:1995–2002.
Helms J, et al. High risk of thrombosis in patients with severe SARS-CoV-2 infection: a multicenter prospective cohort study. Intensive Care Med. 2020;46:1089–98.
Heineke J, et al. Cardiomyocyte GATA4 functions as a stress-responsive regulator of angiogenesis in the murine heart. J Clin Investig. 2007;117:3198–210.
Suzuki YJ, Nagase H, Day RM, Das DK. GATA-4 regulation of myocardial survival in the preconditioned heart. J Mol Cell Cardiol. 2004;37:1195–203.
Dittrich GM, et al. Fibroblast GATA-4 and GATA-6 promote myocardial adaptation to pressure overload by enhancing cardiac angiogenesis. Basic Res Cardiol. 2021;116:26.
Hwang JR, et al. The C-terminus of IGFBP-5 suppresses tumor growth by inhibiting angiogenesis. Sci Rep. 2016;6:39334.
Duan C, Clemmons DR. Differential expression and biological effects of insulin-like growth factor-binding protein-4 and – 5 in vascular smooth muscle cells. J Biol Chem. 1998;273:16836–42.
Rho SB, et al. Insulin-like growth factor-binding protein-5 (IGFBP-5) acts as a tumor suppressor by inhibiting angiogenesis. Carcinogenesis. 2008;29:2106–11.
Struyf S, Burdick MD, Proost P, Van Damme J, Strieter RM. Platelets release CXCL4L1, a nonallelic variant of the chemokine platelet factor-4/CXCL4 and potent inhibitor of angiogenesis. Circ Res. 2004;95:855–7.
Vandercappellen J, Van Damme J, Struyf S. The role of the CXC chemokines platelet factor-4 (CXCL4/PF-4) and its variant (CXCL4L1/PF-4var) in inflammation, angiogenesis and cancer. Cytokine Growth Factor Rev. 2011;22:1–18.
Sarabi A, et al. CXCL4L1 inhibits angiogenesis and induces undirected endothelial cell migration without affecting endothelial cell proliferation and monocyte recruitment. J Thromb Haemost. 2011;9:209–19.
Brandhofer M, et al. Heterocomplexes between the atypical chemokine MIF and the CXC-motif chemokine CXCL4L1 regulate inflammation and thrombus formation. Cell Mol Life Sci. 2022;79:512.
Arakawa N, et al. Serum stratifin and presepsin as candidate biomarkers for early detection of COVID-19 disease progression. J Pharmacol Sci. 2022;150:21–30.
Miao RQ, Agata J, Chao L, Chao J. Kallistatin is a new inhibitor of angiogenesis and tumor growth. Blood. 2002;100:3245–52.
Huang KF, et al. Kallistatin, a novel anti-angiogenesis agent, inhibits angiogenesis via inhibition of the NF-κB signaling pathway. Biomed Pharmacother. 2014;68:455–61.
Chao J, Bledsoe G, Chao L. Protective role of Kallistatin in Vascular and Organ Injury. Hypertension. 2016;68:533–41.
Cheng N, Brantley DM, Chen J. The ephrins and eph receptors in angiogenesis. Cytokine Growth Factor Rev. 2002;13:75–85.
Füller T, Korff T, Kilian A, Dandekar G, Augustin HG. Forward EphB4 signaling in endothelial cells controls cellular repulsion and segregation from ephrinB2 positive cells. J Cell Sci. 2003;116:2461–70.
Du E, Li X, He S, Li X, He S. The critical role of the interplays of EphrinB2/EphB4 and VEGF in the induction of angiogenesis. Mol Biol Rep. 2020;47:4681–90.
Groppa E, et al. EphrinB2/EphB4 signaling regulates non-sprouting angiogenesis by VEGF. EMBO Rep. 2018;19:e45054.
Palma Medina LM, et al. Targeted plasma proteomics reveals signatures discriminating COVID-19 from sepsis with pneumonia. Respir Res. 2023;24:62.
Rovas A, et al. Microvascular and proteomic signatures overlap in COVID-19 and bacterial sepsis: the MICROCODE study. Angiogenesis. 2022;25:503–15.
Urbiola-Salvador V, Lima de Souza S, Grešner P, Qureshi T, Chen Z. Plasma proteomics unveil Novel Immune signatures and biomarkers upon SARS-CoV-2 infection. Int J Mol Sci. 2023;24:6276.
Dos Santos F et al. Plasma enzymatic activity, proteomics and peptidomics in COVID-19-induced sepsis: a novel approach for the analysis of hemostasis. Front Mol Biosci. 2023;9.
Filbin MR et al. Longitudinal proteomic analysis of severe COVID-19 reveals survival-associated signatures, tissue-specific cell death, and cell-cell interactions. Cell Rep Med. 2021;2.
Batra R, et al. Multi-omic comparative analysis of COVID-19 and bacterial sepsis-induced ARDS. PLoS Pathog. 2022;18:e1010819.
Acknowledgements
We are grateful to the front-line healthcare providers who assisted with the collection of blood samples from acutely ill COVID-19 and non-COVID-19 sepsis controls, and the Translational Research Centre (https://translationalresearchcentre.com/) for providing blood samples from healthy control subjects.
Funding
DDF received study funding from the London Health Sciences Foundation (https://lhsf.ca/), the London Community Foundation and the AMOSO Innovation Fund. MAP received studentship funding from the Canadian Institutes of Health Research (Funding #: 187684).
Author information
Authors and Affiliations
Contributions
DDF conceived and designed the study. DDF, LRVN, and MS collected human samples and clinical data. DDF, MAP, MD and GC analyzed all data. MAP and DDF wrote the manuscript with input from all other authors.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
This study was approved by the Western University, Human Research Ethics Board (HREB). Given the unprecedented pandemic situation and the restricted hospital access for substitute decision makers, waived consent was approved for a short, defined period of time (Research Ethics Board [REB] ID# 1670; issued March 20, 2020), and in keeping with the Society for Critical Care Medicine statement on “Waiver of Informed Consent in Emergency Situations”. The last patient enrolled under waived consent was May 1, 2020.
Consent for publication
Not Applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Patel, M.A., Daley, M., Van Nynatten, L.R. et al. A reduced proteomic signature in critically ill Covid-19 patients determined with plasma antibody micro-array and machine learning. Clin Proteom 21, 33 (2024). https://doi.org/10.1186/s12014-024-09488-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12014-024-09488-3