Background

Alzheimer’s disease (AD) is widely recognized as the most common etiology of dementia [1] and is currently ranked as the sixth most prevalent cause of mortality in the United States. The global prevalence of AD is expected to rise to 135 million individuals by the year 2050 [1].

There is currently no definitive diagnostic test or biomarker for the disease, which means that diagnosis often involves ruling out other causes of cognitive decline [2]. Several biomarkers have been identified that can potentially be used for diagnosing AD in its early stages. The four main biomarkers found in cerebrospinal fluid (CSF), i.e., amyloid beta (Aβ)1–42, Aβ42/40 ratio, Tau, and phosphorylated-Tau (p-Tau)181, are reliable for supporting AD diagnosis as they indicate the hallmark AD pathologies of amyloidosis and neurodegeneration [3]. Although these CSF biomarkers are reliable for supporting AD diagnostics, the process of collecting CSF can be inconvenient for subjects and may cause procedural efforts. This prevents their use as a screening item in initial, asymptomatic subjects and makes repetitive monitoring of the disease progression challenging. Therefore, there is a significant necessity to develop blood-based markers that can provide targeted and fairly noninvasive screening tests in the right context of clinical application [4].

Evidence suggests that the neurofilament light chain (NFL) levels, a marker of cytoskeletal protein that rises in CSF and serum following neuroaxonal impairment, increase in individuals with AD [5]. In that line, according to a recent report by Mattsson et al., it was found that plasma NFL can be reliably regarded as a neurodegeneration biomarker in AD [6]. Nevertheless, NFL is not specific to AD, and its concentration similarly increases in other forms of neurodegenerative diseases including vascular and frontotemporal dementias [7]. The insensitivity of classical ELISA methods in accurately detecting trace NFL levels in circulation is another issue that needs to be addressed in the field [8].

Recently, ultrasensitive single molecule array assays (SIMOA) have been used to measure CSF and blood biomarkers. SIMOA technique can measure low concentrations of Aβ, p-Tau, and NFL in blood samples, reflecting those levels measured in CSF [9]. Therefore, accurate measurement of NFL levels in plasma using ultra-sensitive technics such as SIMOA could be a significant step in the early diagnosis of AD [10].

Here, this meta-analysis aimed to evaluate the diagnostic accuracy of plasma NFL levels measured by SIMOA in distinguishing between AD, MCI, and healthy controls.

Methods

Search strategy

The following databases were searched for published articles in English from inception until February 2023: Scopus, Google Scholar, Web of Science, and PUBMED. Our search used the following keywords: Alzheimer’s disease, AD, mild cognitive impairment, MCI, single molecule array, SIMOA, plasma neurofilament light, and NFL as follows: ((((("Alzheimer"[Title/Abstract]) OR ("Alzheimer’s"[Title/Abstract])) OR ("AD"[Title/Abstract])) AND (("neurofilament light") OR ("Nfl"))) AND (("MCI"[Title/Abstract]) OR ("Mild cognitive impairment"[Title/Abstract]))) AND ((("Serum") OR ("Plasma")) OR ("Blood")). In addition, we searched references of selected articles to identify potentially-related studies. The results were reported based on Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement criteria [11]. Two investigators independently searched the databases and in case of any inconsistencies, a senior researcher judged the articles against the inclusion and exclusion criteria, and approved the final list of articles.

Study selection

Studies were considered for inclusion if they met all of the following criteria: 1) Articles reported plasma NFL levels in controls, AD, and MCI patients; 2) Plasma NFL levels were measured by SIMOA; 3) Human studies without restriction in age range and sample size; 4) Clinical studies, longitudinal studies such as cohort or case–control, retrospective or prospective; 5) Studies written in English; 6) Availability of information for each study, including sample size and diagnostic criteria for AD and MCI.

After article identification and review, studies without sufficient data and/or those focusing on other types of diseases (e.g., Parkinson’s disease, Down syndrome, frontotemporal dementia, and vascular dementia) were excluded. Additionally, this study excluded duplicate articles, reviews, case reports, and irrelevant papers.

Data extraction and critical appraisal

Two researchers independently extracted the data from each study, and any disagreement was reviewed by a third author if it was needed. From each included study, the following data were extracted: first author, publication year, age of patients, number of participants, plasma NFL levels, and measuring method of the biomarker. Quality assessment of the studies was accomplished by Joanna Bridge Institute (JBI) critical appraisal tools for cross-sectional, cohort, and case–control studies.

Statistical analysis

The Comprehensive Meta-Analysis Software (CMA) software version 2.0 was used to analyze the data. All data were stated as mean ± standard deviation (SD). Meta-analysis was used if three or more articles assessed a comparable intervention by the fixed-effects model, as their heterogeneity was rather low [12]. The standardized mean difference (SMD) was used to compare the means of the groups in each publication. The I2 statistic was found not to be suitable for evaluating the variability of the effect size in our data, so we did not use it to report heterogeneity. We believe that its intended purpose does not include measuring the extent of effect size differences, and it cannot provide this information unless the I2 value is zero. Instead, we utilized the prediction interval to assess variability (heterogeneity) in our data [13, 14]. Publication bias was calculated using funnel plot and trim and fill analysis which allows the estimation of an adjusted meta-analysis estimate in the presence of publication bias [15]. A p-value of less than 0.05 was considered statistically significant.

Results

This study evaluated the results emerging from 15 publications. All publications assessed the relationship between NFL plasma concentrations and AD and/or MCI diagnosis. The study selection process is depicted in Fig. 1. In the first step, we found 347 articles from different databases. After removing the duplicates, 170 studies were screened based on title and abstract. Then, of the remaining 44 studies, 29 studies were excluded from full-text evaluations. Accordingly, we assessed the remaining studies based on the eligibility criteria of this systematic review. Fifteen out of 44 studies met the eligibility criteria and were included in our systematic review and meta-analysis.

Fig. 1
figure 1

PRISMA flow diagram. Literature selection summary based on the PRISMA guidelines depicting the number of inclusions and exclusions from the initial search. * Review articles, insufficient NFL data, irrelevant articles, and other types of dementia

Demographic features

A total number of 4625 patients were included in the systematic review and quantitative meta-analysis. 1439 participants (31.11%) were diagnosed with MCI, while 1420 (30.70%) had AD. 1766 (38.18%) participants were evaluated as controls of the meta-analysis. Additional information of each study is presented in Table 1.

Table 1 Characteristics of the included studies in the systematic review and meta-analysis

Characteristics of the included studies

Fifteen studies met the inclusion criteria. The number of participants ranged from 40 to 579. The mean age of participants in the AD group ranged from 56.58 ± 4.18 to 79.39 ± 6.66, in the MCI group ranged from 62.41 ± 11.36 to 77.3 ± 5.1, and in control group ranged from 52.5 ± 13.1 to 77.0 ± 6.2.

The reported diagnosis criteria for AD were as follows: Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition, (DSM-IV) [30], National Institute of Neurological and Communicative Disorders and Stroke–Alzheimer’s Disease and Related Disorders Association (NINCDS-ADRDA), or National Institute on Aging-Alzheimer’s Association (NIA-AA) criteria [31]. Also clinical/cognitive assessments for AD patients were done using following tests: Mini-Mental State Examination (MMSE), Clinical Dementia Rating (CDR), Alzheimer Disease Assessment Scale–cognitive subscale (ADAS-Cog 11), Wechsler Memory Scale (WMS) logical memory II, Trail-Making test part B (TMT-B), Wechsler Adult Intelligence Scale–Revised (WAIS-R) digit symbol substitution test.

Also, the diagnosis criteria/tests used for MCI patients were Petersen criteria for MCI [32], NINCDS-ADRDA criteria, NIA-AA criteria, ADAS-Cog 11, MMSE, CDR, WMS logical memory II, TMT-B and WAIS-R digit symbol substitution test.

Of fifteen studies, thirteen studies were cross-sectional, one study was cohort and one study was case–control. Additional information is reported in Table 1.

NFL plasma concentration in MCI vs control

Analysis of 13 articles revealed that the serum NFL level was significantly higher in the MCI group compared with the control group (0.361 SDM, 95% CI 0.286–0.435, p = 0.000, I2 = 49.179) (see Fig. 2). Trim and fill analysis showed the absence of two studies in the MCI-control comparison. Upon adjusting SMD, the values changed to 0.337 (95% CI 0.264–0.410).

Fig. 2
figure 2

Forest plot of standardized mean difference (SMD) in MCI-control. The red square shows the overall pooled effect. Black squares indicate the SMD in each study. Horizontal lines represent a 95% confidence interval (CI)

NFL plasma concentration in AD vs control

The AD group exhibited a significantly higher serum NFL level than the control group (0.808 SDM, 95% CI 0.727–0.888, p = 0.000, I2 = 39.433) (see Fig. 3). Trim and fill analysis identified two missing studies in the AD-control comparison. Upon adding these publications, the adjusted estimate of average efficacy in AD control decreased to 0.786 (95% CI 0.707–0.865).

Fig. 3
figure 3

Forest plot of standardized mean difference (SMD) in AD-control. The red square shows the overall pooled effect. Black squares indicate the SMD in each study. Horizontal lines represent a 95% confidence interval (CI)

NFL plasma concentration in MCI vs AD

Through analysis of 14 articles, we observed that the serum NFL level was significantly higher in the AD group than in the MCI (0.436 SDM, 95% CI 0.359–0.513, p = 0.00, I2 = 37.44) (see Fig. 4). Trim and fill analysis indicated the presence of one missing study in the MCI-control comparison. After adjusting SMD, the value changed to 0.425 (95% CI 0.350–0.501).

Fig. 4
figure 4

Forest plot of standardized mean difference (SMD) in AD-MCI. The red square shows the overall pooled effect. Black squares indicate the SMD in each study. Horizontal lines represent a 95% confidence interval (CI)

Risk of bias across studies

The JBI critical appraisal tool for cross-sectional, cohort, and case–control studies was utilized to assess the risk of bias in the included studies. The tool indicated no significant risk of bias across the cross-sectional, case–control, and cohort studies included in this analysis (see Tables 2, 3 and 4, respectively).

Table 2 The JBI critical appraisal tool for cross-sectional studies
Table 3 The JBI critical appraisal tool for cohort studies
Table 4 The JBI critical appraisal tool for case–control studies

Heterogeneity of the data

The prediction interval analysis revealed a mean effect size of 0.44, with a confidence interval of 0.36 to 0.51. Furthermore, it indicated that in 95% of comparable populations, the true effect size ranged from 0.17 to 0.70. The findings showed significant heterogeneity among the analyzed publications, as demonstrated by the prediction interval, which displayed a wider range of potential treatment outcomes compared to the confidence interval. The potential sources of heterogeneity may be associated with study design, patient characteristics, assay methods, or statistical approaches. The funnel plot analysis included 15 publications and showed a symmetric plot of standard error versus intervention effect, indicating no significant publication bias among the studies (see Fig. 5).

Fig. 5
figure 5

Publication bias. Funnel plot for NFL concentration in included studies

Discussion

Our findings confirm that plasma NFL levels are significantly higher in AD patients than in MCI patients and healthy controls. The values are also higher in MCI patients than in healthy controls. The quality of the identified studies was found to be high based on JBI critical appraisal. However, the heterogeneity of the data was also shown to be high.

Two important issues were taken into account in this meta-analysis. First, our study evaluated the results emerging from publications that used plasma levels of NFL to differentiate between MCI, AD, and healthy controls. While the advantages of CSF biomarkers for AD diagnosis are well-known, there are also some disadvantages such as the invasive procedure required for obtaining CSF and inter-laboratory standardization and reproducibility of CSF biomarker measurements [33]. In contrast, blood biomarkers for AD are gaining attention because they are less invasive and easier to obtain than CSF biomarkers with the potential for earlier and more convenient detection of AD [34]. Additionally, blood samples can be stored and transported more easily than CSF samples, which must be analyzed within a few hours of collection. This allows for wider accessibility and feasibility of blood-based biomarker testing, especially in remote areas or regions with limited healthcare infrastructure. Furthermore, blood-based biomarker testing is generally more cost-effective compared to CSF-based testing, making it a more accessible option for patients [35].

In addition, due to the high cost and the unavailability of imaging methodes such as positron emission tomography (PET), and magnetic resonance imaging (MRI), detecting the blood biomarkers related to AD, offer greater convenience for simultaneously screening a large cohort of individuals [36].

Second, this meta-analysis only included studies that used SIMOA-based detection of NFL in plasma samples of MCI and AD patients and healthy controls. In the past, NFL was measured using ELISA and sensitive electro-chemiluminescence (ECL) methods. However, with the introduction of SIMOA, the measurement of NFL in blood samples became 125 and 26 times more sensitive than ELISA and ECL, respectively [9]. Besides, SIMOA is highly specific and can discriminate between closely related molecules with a high degree of accuracy. This method reduces the requirement for CSF collection and permits more frequent measurement due to the ease of obtaining blood samples [37]. This is important for the detection of complex biomarkers, such as those involved in neurodegenerative diseases, where subtle differences in biomarker isoforms can have diagnostic significance [38]. Also, SIMOA has a wide dynamic range and can measure biomarker levels over a large range of concentrations. This makes it possible to measure biomarker levels in both healthy and diseased populations, increasing the clinical utility of the technology [38].

Improving ultra-sensitive technologies has been crucial in propelling the field forward. SIMOA is a complex technology that requires specialized instrumentation and expertise for operation. From a research standpoint, SIMOA is the most well-established technology for detecting ultrasensitive blood-based biomarkers of AD pathology [39]. The process of translating blood-based biomarkers for AD into diagnostic biomarkers for regular use in patient care will require several stages. These stages include establishing a clear purpose, verifying the analytical and clinical performance within the intended use context, and obtaining regulatory authorization from entities [40].

The Simoa p-tau181 has been granted the Breakthrough Device designation by the FDA to assist in evaluating AD. Similarly, Simoa NfL has also received this designation for diagnosing multiple sclerosis. Moreover, certain companies provide NfL testing that is certified under the Laboratory Improvement Amendments (CLIA) [41]. Thus, there is an expectation that ultrasensitive methods like SIMOA will have great potential for clinical use in various clinical settings.

Currently, research studies have presented strong tests of blood markers to identify amyloid and tau pathologies that are unique to AD (Aβ peptides, p-tau). Additionally, there are also blood markers available for detecting non-specific neuronal (NFL, β-synuclein, ubiquitin-C-terminal-hydrolase-L1) and glial degeneration (glial fibrillary acidic protein) [10, 41, 42].

For instance, Shi et al. [43] utilized an ultrasensitive quantitative technique to illustrate the plasma concentrations of Aβ40, Aβ42, and NFL in individuals diagnosed with amnestic MCI (aMCI). In patients with aMCI, the levels of Aβ40 and Aβ42 were found to be decreased, whereas NFL levels were significantly elevated. Furthermore, increased plasma NFL concentrations were associated with reduced size of the hippocampus and total volume of gray matter in the left inferior and middle temporal gyrus. Similarly, Xiao et al. [44] assessed alterations in the levels of various plasma biomarkers, including Aβ40, Aβ42, t-Tau, NFL, and p-Tau181 in multiple stages of AD using SIMOA. As cognitive impairment progressed, levels of Aβ40, Aβ42 and Aβ42/Aβ40 decreased while levels of t-Tau, NFL and p-Tau181 increased. Among all plasma biomarkers, p-Tau181 was identified as a potential biomarker related to symptoms while NFL was considered a potential non-specific biomarker of neurodegeneration. In another study, Fortea et al. [45] assessed a population at high risk of AD and categorized the samples as asymptomatic, prodromal, or AD dementia. They measured Aβ40, Aβ42, t-Tau, p-Tau181, and NFL levels in both plasma and CSF using the SIMOA method. Plasma NFL levels were found to distinguish between the asymptomatic and prodromal groups as well as between the asymptomatic and dementia groups.

Plasma NFL levels are associated with cognitive deficits, progressive neural atrophy, and neurodegeneration in AD patients [46]. Elevated levels of plasma NFL were also appeared to be related with baseline CSF biomarkers including reduced Aβ42 and elevated total tau (t-Tau) and p-tau levels. Additionally, MRI measures, such as reduced hippocampal volumes, decreased regional cortical thickness, increased ventricular volumes, and reduced FDG-PET uptake were also associated with higher levels of plasma NFL in these patients [25, 46]. These findings were also generalizable to individuals with early-onset AD. In that light, Watson et al., found an increase in plasma NFL levels in AD patients with PSEN1 or APP mutations suggesting that NFL may assist as an initial diagnostic biomarker for early-onset AD, as plasma NFL concentrations were found to be elevated before the manifestation of symptoms (preclinical and prodromal AD) and corresponded with the severity stage of the disease [47]. Furthermore, in a prospective case–control study, Mattsson and colleagues found the precision of plasma NFL in distinguishing AD patients from healthy controls was equivalent to well-known CSF AD biomarkers and substantially higher than plasma tau [6]. These findings were duplicated in another study by Sugerman et al. where they showed that NFL is more accurate predictive factor in dementia than t-Tau, with the possibility to distinguish AD from MCI [21].

However, it’s important to note that a single plasma biomarker may not be sufficient to diagnose a disease; a combination of biomarkers could be more useful in distinguishing AD patients from healthy individuals.

Overall, SIMOA-based assay of plasma NFL levels is a promising approach in monitoring neurodegeneration in MCI and AD patients. It has the potential factor to provide less invasive and more economical alternative to current diagnostic methods. However, more research is needed to fully establish the diagnostic accuracy and clinical utility of this approach.

Limitations

Our meta-analysis had several potential limitations. First, the studies we mainly included relied on clinical criteria for diagnosing AD patients, which may result in selection bias of participants, and more studies warranted to diagnose the patient via clinical assessment and brain imaging such as PET and MRI. Second, our access to data and papers was not universal, and we only searched PUBMED, SCOPUS, Web of Science, and google scholar, which could have influenced our results. Third, the studies we analyzed had limited sample sizes, potentially affecting accuracy; therefore, more multicenter studies with larger sample sizes are required. Fourth, our meta-analysis was restricted to studies published in English, so we may have missed relevant research in other languages, which may result in selection bias in our study.

Conclusion

This meta-analysis demonstrated that by measuring NFL levels using the SIMOA method, we are able to differentiate AD and MCI patients from the healthy controls. In that light, NFL has the potential to be used as a diagnostic marker in AD patients. However, the included studies showed high heterogeneity, indicating that the outcome of this meta-analysis should be treated with caution.