The performance relationship between the EQ-5D-5L composite “Anxiety/Depression” dimension and anxiety and depression symptoms in a large, general population sample

Scott, Emily Stella; Lubetkin, Erica I.; Janssen, Mathieu F.; Yfantopolous, John N.; Bonsel, Gouke J.; Haagsma, Juanita A.

doi:10.1007/s11136-024-03754-5

The performance relationship between the EQ-5D-5L composite “Anxiety/Depression” dimension and anxiety and depression symptoms in a large, general population sample

Open access
Published: 13 September 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Quality of Life Research Aims and scope Submit manuscript

The performance relationship between the EQ-5D-5L composite “Anxiety/Depression” dimension and anxiety and depression symptoms in a large, general population sample

Download PDF

Emily Stella Scott ORCID: orcid.org/0000-0001-7548-1775¹,
Erica I. Lubetkin²,
Mathieu F. Janssen³,
John N. Yfantopolous⁴,
Gouke J. Bonsel⁵ &
…
Juanita A. Haagsma¹

Abstract

Purpose

This cross-sectional study aims to understand the relationship between responses on the Anxiety/Depression (A/D) dimension of the EQ-5D-5L and symptoms of anxiety and depression on the GAD-7 and PHQ-9 instruments. In doing so, we investigate the comparative performance of the dimension between diagnostic groups (i.e. anxiety (GAD-7); depression (PHQ-9); anxiety & depression versus none). We additionally investigate the discriminatory performance between sub-populations based on gender, age, education and self-reported chronic conditions.

Methods

19,902 general population participants completed a health survey in May/June 2020, from five European countries and the United States. Performance of A/D was calculated using the Area Under the Receiver Operating Characteristic curve (AUROC), and was compared to having anxiety (GAD-7 ≥ 8), depression (PHQ-9 ≥ 10) and both versus none for the total population and sub-populations. Several additional sensitivity analyses were conducted, including calculations of the optimal A/D cut-off.

Results

The performance in the total sample was good (AUROC > 0.8) and did not differ significantly between diagnostic groups. The performance differed significantly between the age groups, with worse performance in the younger groups, and differed between those with a singular chronic condition, with worse performance in those indicating having an anxiety or depression disorder. The performance did not differ significantly by gender, education, nor total chronic conditions.

Conclusion

The A/D dimension captures symptoms of anxiety, depression or both equally well. Performance is worse in the younger population. Interpretation in those with a self-reported anxiety or depression disorder should be further investigated. This is the first-of-its-kind large population sample performance analysis, where we present evidence that the performance of the A/D dimension differs between ages, and thus intra-age comparative results may be flawed.

The performance of the EQ-5D-3L in screening for anxiety and depressive symptoms in hospital and community settings

Article Open access 19 March 2021

New cut-off points of PHQ-9 and its variants, in Costa Rica: a nationwide observational study

Article Open access 31 August 2023

The 14-item short health anxiety inventory (SHAI-14) used as a screening tool: appropriate interpretation and diagnostic accuracy of the Swedish version

Article Open access 14 November 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Health-related quality of life (HRQOL) is a subjective, multi-dimensional concept that constitutes physical and social functioning, pain and psychological symptoms, and more [1]. The EQ-5D-5L is a generic instrument measuring HRQOL in five short questions referring to “today” [2, 3]. These include core aspects of HRQOL that are summarised in dimensions (5D): Mobility, Self-care, Usual Activities, Pain/Discomfort and Anxiety/Depression. The EQ-5D-5L is widely applied in clinical, population and economic studies [3, 4], informing patient management and policy decisions [5]. Given its broad application, it is important the instrument is concise and easy to use, yet an accurate metric and reliable for use in (sub-)populations [5].

The Anxiety/Depression dimension (A/D) of the EQ-5D-5L covers psychological symptoms within HRQOL, as demonstrated using confirmatory factor analysis [6]. The dimension has a composite formation, because it consists of two distinct nosological concepts – anxiety and depression. The formation requires the respondent to provide a single response on the level of severity of these two symptoms. Yet how it is interpreted and answered by respondents is multifactorial and complex. The anxiety and depression terms were chosen because anxiety and depressive disorders are the most prevalent specific psychological conditions departing from the healthy state [7], and commonly are co-morbid conditions in general and clinical populations [8].

Instruments are required to be adequately reliable and valid in order to provide legitimately useful and meaningful results. As the A/D dimension covers psychological symptoms, it is required that the dimension adequately captures anxiety and depression symptoms. Using the Area Under the Receiver Operating Characteristic curve (AUROC), which quantifies the overall ability of a test to discriminate between two groups by integrating sensitivity and specificity into a performance value, one can determine whether the same underlying construct is measured between two instruments [9]. In previous studies that analysed the performance of the A/D dimension as a screening tool for psychiatric conditions, it was found to perform fairly well (AUROC = 0.78–0.86) in screening a community population (90 days post-discharge from hospital), poorly (AUROC = 0.70–0.74) in a hospital population [10], and well in a diabetic adult population (AUROC = 0.88–0.92) [11]. The discriminatory performance of the A/D dimension for detection of anxiety or depression symptoms equalled that of in-depth screening instruments for anxiety and depression.

As the above studies illustrate, the A/D dimension is used in a wide variety of populations given the EQ-5D’s broad applications, meaning it must consistently discriminate across different populations. Thus, it is essential to understand the relationship between responses on the A/D dimension and symptoms of anxiety and depression for informing on the EQ-5D-5L A/D dimension’s validity in different populations, to ensure that comparisons across groups are valid and meaningful. Therefore, the objective is to estimate the discriminatory performance of the composite A/D dimension of the EQ-5D-5L instrument in capturing anxiety and depression symptoms as measured by the Generalised Anxiety Disorder (GAD-7) instrument and the Patient Health Questionnaire (PHQ-9) instrument, respectively. Additionally, we explored the performance of the A/D dimension between sub-populations of a general population sample based on differences by gender, age, education level and chronic conditions.

Methods

Population, data collection and consent

Participant data for this secondary analysis were used from the POPulation health impact of the CORoNavirus disease 2019 (COVID-19) pandemic (POPCORN) study, a longitudinal study that aimed to investigate the broader effects of the COVID-19 pandemic on HRQOL and mental health of the general population in various countries. Participants were enlisted by a market research agency to which written informed consent was provided upon registration to the agency’s voluntary panels. Upon enlisting, the general population participants were aged 18–75. The sample was by design representative for age, sex and education within each country. Data was collected via web-based surveys that were first distributed in early 2020, and then annually until 2023. Once a participant started the survey, the data collection system would not allow skipping or missing questions. A small reward in the form of cash or points was provided by the agency upon completion. The data were anonymised by the agency.

This methodological study is analysing data collected April-May 2020, from six countries: Greece, Italy, The Netherlands, Sweden, the United Kingdom (UK) and the United States (US). For the EQ-5D questionnaires all official EQ-5D-5L translations were used. Where available, official translations for the GAD-7 and PHQ-9 were used; this did not include Swedish. For the remaining questions, the surveys were translated into the respective national language of the country by the agency. The translations were cross-checked by bilingual speakers who had a scientific background. There are no missing values as the survey system does not allow for unanswered or skipped questions.

Measures of anxiety and depression

The last question of the EQ-5D-5L is the A/D dimension. The instrument refers to a period of “today”. Participants rate the level of severity of their problems on a 5-Level (5 L) scale, as either “none”, “slight”, “moderate”, “severe” or “unable to/extreme”, hence level scores range from 1 to 5, respectively [2].

The GAD-7 is a 7-item questionnaire that aims to detect generalised anxiety and other anxiety disorders [12]. The PHQ-9 is a 9-item questionnaire that aims to detect depressive disorders cf. DSM-IV [13]. Both instruments refer to a period over the last two weeks. The GAD-7 includes questions on symptoms of nervousness, worry, irritability, etc., and the PHQ-9 on symptoms of hopelessness, little energy, and more. Participants rate their symptom frequencies between “0 = not at all” to “3 = nearly every day” on a 4-item ordinal scale. Therefore, the total minimum score is 0 and maximum scores are 21 and 27 for the GAD-7 and PHQ-9, respectively. Based on the total score, the severity is categorised as mild, moderate, moderately severe (only for the PHQ-9) or severe, with cut-offs of 5 and above, 10, 15, and 20 (only PHQ-9), respectively. Based on the literature, we used a cut-off score of ≥ 8 [14, 15] from which to differentiate anxiety from no anxiety and of ≥ 10 from which to differentiate depression [13; 16].

As the GAD-7 and PHQ-9 instruments are specific to measuring anxiety and depression, respectively, using a comprehensive list of symptom-related questions, these instruments were used in our study as the “gold standards” to compare the A/D dimension with. In general population samples, the GAD-7 [17] and PHQ-9 [18] were found to have good construct validity and reliability (internal consistency) (Cronbach’s α = 0.89 & 0.87, respectively).

Data analysis

The diagnostic groups that were used to evaluate the performance of the A/D dimension were split into anxiety (score of GAD-7 ≥ 8), depression (PHQ-9 ≥ 10) and co-morbid anxiety and depression (GAD-7 ≥ 8 & PHQ-9 ≥ 10). As the cases in these groups are not exclusive (i.e. some cases occur in more than one group), for the descriptive statistics only, four mutually exclusive groups were created, which we call here the diagnostic sub-groups. These are defined as:

No anxiety (GAD-7 < 8) and no depression (PHQ-9 < 10)
Anxiety present (GAD-7 ≥ 8) and no depression (PHQ-9 < 10)
No anxiety (GAD-7 < 8) and depression present (PHQ-9 ≥ 10)
Anxiety (GAD-7 ≥ 8) and depression present (PHQ-9 ≥ 10)

For analysing the descriptive statistics between the diagnostic sub-groups, the chi-square, Fisher’s exact and ANOVA tests were used to test for statistically significant differences in the number of observations between groups.

In order to examine the performance of the A/D dimension, we used the AUROC analysis. The AUROC is interpreted as the average sensitivity value across all possible specificity values, and, therefore, is a measure of the overall discriminatory performance of a test [19]. A requirement for this analysis is that the outcome variable must be binary (disease present versus absent), which is not possible for the “No anxiety and no depression” sub-group. Therefore, the A/D dimension was compared within their diagnostic group to symptoms of anxiety (versus no anxiety), depression (versus no depression) and both combined (versus having neither anxiety nor depression), as measured by the GAD-7 and PHQ-9. In this way, we compared the performance between instruments, and further compared the performances between sub-populations by splitting the population based on age, gender, education and chronic conditions. Age was numeric and categorised into four age groups: 18–30; 31–45; 46–60; 61–75, because these are equally large intervals of 15 years (with the exception of the youngest age group). The mid-point (age 45) is also the median age in our sample. Gender had three possible outcomes: male, female and other. The highest level of achieved education was categorised according to the International Standard of Classification of Education (ISCED) into low (ISCED 0–2), medium (ISCED 3–4) and high (ISCED 5–8). All or no chronic conditions could be selected from the listed options: Asthma; chronic bronchitis; Severe heart disease; Consequences of a stroke; Diabetes; Chronic rheumatoid arthritis; Severe back complaints/arthrosis of the back; Painful/swollen joints of knee or hip due to arthrosis; Cancer; Memory problems due to a neurological disease/dementia; Memory problems due to ageing; Depression or anxiety disorder, including an open box statement (Other chronic complaints). In this paper also the population with a singular chronic condition (as opposed to none or more than one) was separately analysed, comparing those that selected Depression or anxiety disorder versus those that selected any other chronic condition. The discriminatory performance between diagnostic groups and sub-populations is compared using the AUROC, whereby a larger value is considered to have improved discriminatory performance [9]. The following AUROC value criteria were used: ≤0.5 = useless test; 0.5< - ≤0.7 = poor test; 0.7< - ≤0.8 = moderately accurate test; 0.8< - ≤0.9 = good test; 0.9< - ≤0.99 = excellent test and 1 = perfect test [20]. The AUROC scores were calculated using the parametric method (smoothing) as is recommended for discrete rating scales and large sample sizes [19], as well as their 95% confidence intervals using 2,000 stratified bootstrap repetitions. We additionally conducted sensitivity analyses for all AUROC analyses using the non-parametric method, as the data on the EQ-5D A/D dimensions was not normally distributed [9], as well as using higher thresholds of ≥ 10 and ≥ 15 for the GAD-7 and PHQ-9, respectively. Statistically significant comparisons were made examining solely the AUROC confidence intervals.

To support the performance analysis, we calculated the sensitivity, specificity, Positive Predictive Value (PPV)/Precision, Negative Predictive Value (NPV) and accuracy using different thresholds of the A/D dimension with the diagnostic groups in the total population and by sub-population. To determine the optimal cut-off points for the A/D dimension in the total sample and by sub-population, we calculated the Youden index from the sensitivity and specificity scores [21]. Statistical analyses were carried out using IBM SPSS version 28.0.1.0. For the AUROC analyses we used R Studio Version 4.2.1 and the pROC open-source package [22]. Figures were created using Microsoft Excel.

Results

Respondent characteristics and their mental health

In total, 19,902 participants were included in the study. The median age was 45 (interquartile range: 26) and most participants were highly educated (52.2%) (Table 1). Overall, 46.8% of our sample had one or more chronic conditions. Depression/anxiety disorder was a listed chronic condition, of which 2,699 (13.6%) participants from the total sample selected this condition (Table S1). Of those participants that have a singular chronic condition (n = 5,892), 1,248 (21.2%) have an anxiety or depression disorder (Table 1). More information on participants with chronic conditions is in the appendix (Tables S1-S2).

Table 1 Respondent characteristics and descriptive data on mental health for the total sample and by diagnostic sub-groups (N = 19,902)

Full size table

On the A/D dimension, half the participants (50.5%) had slight to extreme problems (referred to as “any problems” from here on) (Table 1). Anxiety symptoms (GAD-7 ≥ 8) occurred in 4,724 (23.7%) participants, and depression symptoms (PHQ-9 ≥ 10) in 4,221 (21.2%) participants. When taking the total sample apart into mutually exclusive groups (diagnostic sub-groups), then 14,320 (72%) had no anxiety nor depression symptoms, 1,361 (6.8%) had anxiety symptoms only, 858 (4.3%) had depression symptoms only, and 3,363 (16.9%) had both anxiety and depression symptoms. By age group, any problems on the A/D dimension steadily increased from 34.1 to 63.2% with decreasing age (Fig. 1a). Similarly for symptoms of anxiety and depression, rates gradually increased from 9.1 to 37.7% and from 7.3 to 36.7%, respectively, with decreasing age. Females and other had a higher prevalence of symptoms than males (Fig. 1b), middle and highly educated had a slightly higher prevalence than low educated (Fig. 1c), those with one or more chronic conditions had around 2.5 times higher symptom prevalence on the GAD-7 and PHQ-9 than those with no chronic conditions (Fig. 1d), and those with a single chronic condition of anxiety/depression had around two times higher symptom prevalence than those with any other single chronic condition (Fig. 1e). Table 1 further illustrates the breakdown of mental health outcomes by diagnostic sub-group for each age group and problems on the A/D dimension, and provides the inclusion criteria for the diagnostic groups on the GAD-7 and PHQ-9 scales. The differences in frequencies between the diagnostic sub-groups differed significantly by sub-populations (age group, gender, educational level, chronic conditions and singular chronic condition) (p ≤ .005) (Table 1).

Discriminatory performance

As a preliminary analysis to the AUROC (Fig. 2a and b), we see that the levels of problems on the A/D dimension against the prevalence of severity on both the GAD-7 and PHQ-9 shows a steady gradient in extremities. For example, severe anxiety symptoms on the GAD-7 range from 1.1% in “no problems” on the A/D dimension to 64.7% in “extreme problems” in the total population (Fig. 2a). To support the AUROC performance analysis, the diagnostic group frequencies per sub-population are presented in Table 2. The A/D dimension performance for the total sample against the GAD-7 and PHQ-9 ranged in AUROC between 0.853 and 0.859, and did not differ significantly between diagnostic groups (Table 3). Likewise did the performance not differ significantly between the diagnostic groups within each sub-population. This was confirmed again in the non-parametric (Table S5) and higher threshold AUROC analyses (Table S6).

Table 2 Frequencies and proportions of diagnostic groups for the total sample and by sub-population (age group, gender, education and chronic conditions) (N = 19,902)

Full size table

Table 3 Discriminatory performance of the EQ-5D-5L A/D dimension compared to the diagnostic groups, for the total sample and by sub-populations (gender, age group, education and chronic conditions), using AUROC analysis

Full size table

Across the age groups, the performance differed significantly between ages 18–30 and 31–45 versus 46–60 and 61–75 in all diagnostic groups, with AUROC ≤ 0.823 in the two younger groups and AUROC ≥ 0.874 in the two older groups (Table 3). Moreover, the AUROC was consistently ≤ 0.804 in the under-30-year-olds and ≥ 0.910 in the over-61-year-olds. The performance between groups within gender, education and chronic conditions (none vs. one or more) did not differ significantly, apart from marginally worse performance in the highly educated group compared to the low and middle educated in two of the three diagnostic groups. However, the performance did differ significantly between those with one chronic condition that is an anxiety/depression disorder and those with any other singular chronic condition. The AUROC ranged between 0.726 and 0.750 in the former group and thus presents the lowest performance among all groups, whereas the AUROC ranged between 0.832 and 0.838 for the latter group.

Given that the performance was found to differ significantly by age group and singular chronic conditions, we sought to examine whether the differences persisted by further splitting the performance analyses (Table S4). The age differences in performance largely persisted when further split by education level, most noticeably in the high education group. The stepwise differences between age groups only persisted when further split by singular chronic conditions in those with a chronic condition other than anxiety or depression. Whereas the differences in AUROC between the singular chronic conditions groups persisted more strongly in the higher age groups when comparing equals (same age groups). Furthermore, the lower versus higher performance in those having an anxiety or depression disorder versus any other persisted in the middle and high education groups only, not the low.

Using the non-parametric performance analysis yielded overall lower AUROC values, but the differences and their conclusions remained (Table S5). Similarly, the sensitivity analysis using higher thresholds for the diagnostic groups (GAD-7 ≥ 10 and PHQ-9 ≥ 15) yielded higher AUROC values, but the differences and their conclusions largely remained the same, except that the differences between ages were now more pronounced when split by education and by singular chronic condition than they were with the lower thresholds (Table S6).

Optimal A/D dimension cut-off

Supporting frequency analyses found that the percentage of non-corresponding results on problems on the A/D dimension versus anxiety or depression symptoms was overall higher in the two younger age groups than the two older when using a cut-off on the A/D dimension of ≥ 2 (Tables S7A-C) and ≥ 3 (Tables S8A-C). The same applied in the prevalence of non-corresponding results between the singular chronic conditions groups, where this was higher for both a cut-off of ≥ 2 and ≥ 3 on the A/D dimension in the anxiety/depression group compared to the any other group (Tables S7A-C & S8A-C). The supporting data on sensitivity, specificity, PPV, NPV, accuracy and the Youden’s index show that the highest sensitivity of the A/D dimension in each of the diagnostic groups for the total sample is reached with a cut-off score of ≥ 3 for having A/D – at the cost of lower specificity, accuracy and PPV, but not NPV (Tables S9-S12). Among the two younger age groups, the better score is reached with a cut-off point on the A/D dimension of ≥ 3, as this is the highest Youden’s index of between 0.42 and 0.49 (Table S10). Contrarily, among the two older age groups, the better cut-off point is ≥ 2 (highest Youden’s index: 0.52–0.65) (Table S9). Among those with a single chronic condition that is anxiety or depression, the Youden Index of 0.32–0.37 indicates a cut-off of ≥ 4 to be the more adequate for this group (Table S11). And finally, for those with a single chronic condition other than anxiety or depression, the ideal cut-off point is between ≥ 2 or ≥ 3, as the Youden Index ranges between 0.44 and 0.47 (Tables S9-S10).

Discussion

Our results showed that in the total sample performance analysis, the A/D dimension demonstrated good performance (AUROC > 0.85) against the GAD-7 and PHQ-9 instruments. The performance did not differ significantly between diagnostic groups, meaning the A/D dimension was not better at capturing either anxiety or depression. This statement held true when splitting the performance analysis by age group, gender, education and chronic conditions and in the sensitivity analyses. When analysing the performance by age group, significant differences were observed between those aged 18–45 and those aged 46–75, with poorer performance in the younger group. In the performance by singular chronic condition, those with only an anxiety or depression condition had significantly worse performance than those with any other singular chronic condition.

Performance between diagnostic groups

This performance analysis study is one of two studies conducted specifically on the 5-level version of the EQ-5D’s A/D dimension in comparison to anxiety- and depression-specific screening tools [10; 11]. In one study involving participants following hospital discharge (named community setting), the performance of the 3-level version of the EQ-5D A/D dimension was evaluated against the GAD-2, PHQ-9 and both combined [10]. In comparison to this study, our AUROC results for the total sample are slightly improved, which could be a result of the improved discriminatory power of the 5-level version over the 3-level version [23]. Considering this, the AUROC values are generally comparable to those of the community setting results (AUROC: 0.78–0.86), and as in our study, did not differ significantly between instruments. In a further performance analysis study comparing the A/D dimension to the GAD-2 and PHQ-8 and both combined, slightly higher performance was detected, with improved AUROC scores in the good and excellent range [11]. This is most likely due to their study population being older on average compared to ours, which also supports our findings on the differences between the age groups (see following section). Again, the performance did not differ significantly by instrument [11]. Together with our study, these studies show that the A/D dimension is equally sensitive in picking up anxiety, depression and both anxiety and depression symptoms, exhibiting high convergent validity. Further, it is similarly sensitive across different population samples. This was not surprising, as the A/D dimension has been shown to be interpreted as taking both anxiety and depression symptoms into account in individuals self-reporting their health, compared to the Pain/Discomfort dimension where only Pain was mainly used to report on [24, 25].

Age group performance

We observed that the A/D dimension performance was significantly improved in the older population. Moderate to good performance prevailed in the younger age groups whereas good to excellent performance was observed in the older age groups. Significant differences persisted when we additionally split the age group performance by education and chronic conditions.

These differences in performance could indicate differential item functioning (DIF) of the A/D dimension between generations. It may occur that an item of interest does not measure a construct equivalently across different groups, leading to DIF. For example, it was found that the A/D dimension exhibited age-related DIF, between older (aged 65 + years) and younger adults (aged 18–64), whereby older adults were less likely to report problems [26]. This was also found to be the case in this study, with the frequency of reporting any problems on the A/D dimension, as well as symptoms on the GAD-7 and PHQ-9, being comparatively lower in the older population compared to the younger.

Singular chronic condition performance

We also observed significant differences in the performance of the A/D dimension between participants that have a singular chronic condition that is an anxiety or depression disorder and those that have a different singular chronic condition other than anxiety/depression. The former group had significantly worse overall performance compared to the latter group, but also compared to the other performance data. The performance values are classified as moderately accurate only, compared to good to excellent in the remaining population groups of this study. This was an unexpected finding – we would have expected the discriminatory performance to be highest in those with a (though self-reported) diagnosis of anxiety/depression disorder, exhibiting known-group validity. Studies have validated the EQ-5D instrument in diagnosed anxiety disorder or major depressive disorder (MDD) populations [27, 28], of which two found specifically the A/D dimension to be strongly correlated with other disease-specific instruments, including the GAD-7 and PHQ-9 [29, 30]. However, in the study by Supina et al., where reporting problems on the A/D dimension was compared between participants with an anxiety only diagnosis, major depressive episode only or both in a logistic regression, they concluded that there was a need for the A/D dimension to better distinguish between persons with a single anxiety or depression disorder [31]. In that study, the A/D dimension more strongly distinguished those with co-morbid anxiety and depression. However this is not reflected in the current study nor by Short et al. [10].

Given the high prevalence of 94% on any problems on the A/D dimension in the singular chronic anxiety/depression disorder group and the therewith lower prevalence of anxiety and depression symptoms on the GAD-7 (≥ 8) and PHQ-9 (≥ 10) instruments, respectively, we theorise that the lower performance has to do with medicinal treatment of symptoms in this group. We suggest that in this group the responses on the GAD-7 and PHQ-9 were lower because they were suffering less frequently from symptoms of their conditions, as they were possibly being treated with medication. Whereas responses on the A/D dimension remained high because they indeed have the condition today, but this was not considered to be related to anxiety/depression symptoms, but rather simply having the condition. Since it was a self-reported survey, no explanation of the EQ-5D instrument was provided, thus it is possible that this dimension is interpreted differently in a participant with an anxiety or depression disorder. As the evidence for this theory is limited, the interpretation in this specific group could be important to investigate.

As far as we could find, there were no studies analysing the A/D responses using a population that self-reported a chronic anxiety or depression disorder. Rather, the studies involved were either a professionally diagnosed anxiety disorder or MDD population, or a self-reporting healthy/diseased general population sample (however when diseased, not specifically having an anxiety/depression disorder).

Strengths and limitations

This study is the largest to date to analyse the performance of the EQ-5D-5L A/D dimension, with almost 20,000 general population adults included across the majority of the lifespan. It is also the first to investigate the discriminatory performance between age groups and self-reported chronic conditions. Having said that, the EQ-5D-5L is strictly employed as a complete instrument, which was not taken into account here, as we have relied solely on the A/D dimension to capture anxiety and depression symptoms. This does not provide a complete and accurate picture of the state of anxiety and depression in the individual. All dimensions of the EQ-5D to some extent may capture symptoms of anxiety and/or depression [30; 31]. On the other hand, the A/D dimension is the primary dimension of the EQ-5D descriptive system that captures psychological symptoms [6], and as such is the most relevant in terms of representing an individual’s anxiety/depression state. This statement also reflects the fact that a person’s response heuristics to the EQ-5D questionnaire (and most other questionnaires, for that matter) is not a structured categorising of symptoms, like in Boolean logic. In the example of the A/D dimension, the Boolean logic would prescribe a ‘computational’ weighing and adding of symptoms due to the dimensions composite nature. However, we would like to stress that a person’s response does not function as the Boolean logic describes. It is more complex, multi-faceted and simply human than that.

In this analysis we could not truly determine whether differences by age and chronic conditions are indeed due to participants’ age or their conditions, or another unmeasured cohort effect. Having said that, chronic conditions are more prevalent among older adults, and we did not see significant differences in the performance between those with no chronic conditions and those with. Yet the older age groups showed improved performance. Moreover, differences persisted in all sensitivity analyses, including those split by education, and education reflects to some extent both living standards in early life and the cultural component of one’s socio-economic status [32]. Nonetheless, to validate these findings, we would advise employing an analysis strategy that can account for influencing variables that are likely to impact age and poor health, such as more nuanced living standards (e.g. GDP per capita or income) [33].

Furthermore, the performance of the A/D dimension is compared to the “gold standard” GAD-7 and PHQ-9, which are anxiety- and depression-specific and complete screening instruments. However, these two instruments are imperfect estimations, as the data are ultimately self-rated and not professional diagnoses established with a diagnostic interview. In a meta-analysis on the accuracy of the PHQ-9, it was found that the instrument may be more specific among older patients (aged 60 or over) [16]. Thus, to some extent, the differences in age may be due to the PHQ-9; however this is unlikely to explain the complete picture of differences, particularly not those in the GAD-7 diagnostic groups. This is generally the bias one encounters when investigating accuracy of self-rated questionnaires using ROC analysis, because one measure has to be regarded as the “gold standard” [34]. Having said that, these instruments have frequently proven to be accurate screening tools for the detection of generalised anxiety disorder and other anxiety disorders [35], and depression disorders cf. DSM-IV [16, 36, 37]. Our comparison is inherently imperfect as the time scales and structures are different: problems experienced “Today” on the A/D dimension compared to frequency of symptoms across “two weeks” on the GAD-7 and PHQ-9 instruments. However, in order to achieve our aim we needed to compare the EQ-5D A/D dimension with a disease-specific instrument. As the instruments’ intentions are thus different, these structural differences were unavoidable. This is likely to have affected the performance values overall, but unlikely to have affected the comparisons we make across groups.

Conclusions

In our performance analysis of the EQ-5D-5L A/D dimension to understand the relationship between responses on the dimension and symptoms on the specific screening tools, we found that the performance was similar between diagnostic groups, thus was equally sensitive in capturing symptoms of anxiety, depression and both. Performance was worse in the younger population, possibly due to age-related differential item functioning of the A/D dimension. Performance was also worse in those having indicated anxiety/depression disorder as a chronic condition, possibly due to the lack of a description of symptoms in the A/D dimension – interpretation in this group should be further investigated. This study marks the first to analyse performance differences of the A/D dimension between groups of a general population. We present evidence that the performance of the A/D dimension may differ between generations, and thus intra-age comparative data using the EQ-5D may be flawed. We recommend further exploring these differences, given the concerning trend in mental health problems among the young population and overall.

Data availability

The original contributions presented in the study are included in the article/Supplementary Materials, further inquiries can be directed to the corresponding author/s.

References

Lapin, B. R. (2020). Considerations for reporting and reviewing studies including Health-Related Quality of Life. Chest, 158(1, Supplement), S49–S56.
Article PubMed Google Scholar
Brooks, R. (1996). EuroQol: The current state of play. Health Policy, 37(1), 53–72.
Article CAS PubMed Google Scholar
Feng, Y. S., Kohlmann, T., Janssen, M. F., & Buchholz, I. (2021). Psychometric properties of the EQ-5D-5L: A systematic review of the literature. Quality of Life Research, 30(3), 647–673.
Article PubMed Google Scholar
Rabin, R., & Charro, F. (2001). EQ-5D: A measure of health status from the EuroQol Group. Annals of Medicine, 33(5), 337–343.
Article CAS PubMed Google Scholar
Guyatt, G. H., Feeny, D. H., & Patrick, D. L. (1993). Measuring health-related quality of life. Annals of Internal Medicine, 118(8), 622–629.
Article CAS PubMed Google Scholar
Finch, A. P., & Mulhern, B. (2022). Where do measures of health, social care and wellbeing fit within a wider measurement framework? Implications for the measurement of quality of life and the identification of bolt-ons. Social Science and Medicine, 313, 115370.
Article PubMed Google Scholar
Institute of Health Metrics and Evaluation. (2019). Global burden of Disease (GBD) results. Global Health Data Exchange (GHDx).
Belzer, K., & Schneier, F. R. (2004). Comorbidity of anxiety and depressive disorders: Issues in conceptualization, Assessment, and treatment. Journal of Psychiatric Practice^®, 10(5), 296–306.
Article PubMed Google Scholar
Nahm, F. S. (2022). Receiver operating characteristic curve: Overview and practical use for clinicians. Korean J Anesthesiol, 75(1), 25–36.
Article PubMed PubMed Central Google Scholar
Short, H., Al Sayah, F., Ohinmaa, A., & Johnson, J. A. (2021). The performance of the EQ-5D-3L in screening for anxiety and depressive symptoms in hospital and community settings. Health and Quality of Life Outcomes, 19(1), 96.
Article PubMed PubMed Central Google Scholar
Al Sayah, F., Ohinmaa, A., & Johnson, J. A. (2018). Screening for anxiety and depressive symptoms in type 2 diabetes using patient-reported outcome measures: Comparative performance of the EQ-5D-5L and SF-12v2. MDM Policy & Practice, 3(2), 2381468318799361.
Article Google Scholar
Spitzer, R. L., Kroenke, K., Williams, J. B. W., & Löwe, B. (2006). A brief measure for assessing generalized anxiety disorder: The GAD-7. Archives of Internal Medicine, 166(10), 1092–1097.
Article PubMed Google Scholar
Kroenke, K., Spitzer, R. L., & Williams, J. B. (2001). The PHQ-9: Validity of a brief depression severity measure. Journal of General Internal Medicine, 16(9), 606–613.
Article CAS PubMed PubMed Central Google Scholar
Johnson, S. U., Ulvenes, P. G., Øktedalen, T., & Hoffart, A. (2019). Psychometric properties of the General anxiety disorder 7-Item (GAD-7) scale in a Heterogeneous Psychiatric Sample. Frontiers in Psychology, 10.
Kroenke, K., Spitzer, R. L., Williams, J. B., Monahan, P. O., & Löwe, B. (2007). Anxiety disorders in primary care: Prevalence, impairment, comorbidity, and detection. Annals of Internal Medicine, 146(5), 317–325.
Article PubMed Google Scholar
Levis, B., Benedetti, A., & Thombs, B. D. (2019). Accuracy of Patient Health Questionnaire-9 (PHQ-9) for screening to detect major depression: Individual participant data meta-analysis. Bmj, 365, l1476.
Article PubMed PubMed Central Google Scholar
Löwe, B., Decker, O., Müller, S., Brähler, E., Schellberg, D., Herzog, W., & Herzberg, P. Y. (2008). Validation and standardization of the generalized anxiety disorder screener (GAD-7) in the General Population. Medical Care, 46(3).
Kocalevent, R. D., Hinz, A., & Brähler, E. (2013). Standardization of the depression screener Patient Health Questionnaire (PHQ-9) in the general population. General Hospital Psychiatry, 35(5), 551–555.
Article PubMed Google Scholar
Park, S. H., Goo, J. M., & Jo, C. H. (2004). Receiver operating characteristic (ROC) curve: Practical review for radiologists. Korean Journal of Radiology, 5(1), 11–18.
Article PubMed PubMed Central Google Scholar
Hanley, J. A., & McNeil, B. J. (1982). The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology, 143(1), 29–36.
Article CAS PubMed Google Scholar
Youden, W. J. (1950). Index for rating diagnostic tests. Cancer, 3(1), 32–35.
Article CAS PubMed Google Scholar
Robin, X., Turck, N., Hainard, A., Tiberti, N., Lisacek, F., Sanchez, J. C., & Müller, M. (2011). pROC: An open-source package for R and S + to analyze and compare ROC curves. Bmc Bioinformatics, 12(1), 77.
Article PubMed PubMed Central Google Scholar
Janssen, M. F., Pickard, A. S., Golicki, D., Gudex, C., Niewada, M., Scalone, L., Swinburn, P., & Busschbach, J. (2013). Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L across eight patient groups: A multi-country study. Quality of Life Research, 22(7), 1717–1727.
Article CAS PubMed Google Scholar
McDonald, R., Mullett, T. L., & Tsuchiya, A. (2020). Understanding the composite dimensions of the EQ-5D: An experimental approach. Social Science & Medicine, 265, 113323.
Article Google Scholar
Engel, L., Whitehurst, D. G. T., Haagsma, J., Janssen, M. F., & Mulhern, B. (2023). What is measured by the composite, single-item pain/discomfort dimension of the EQ-5D-5L? An exploratory analysis. Quality of Life Research, 32(4), 1175–1186.
Article PubMed Google Scholar
Penton, H., Dayson, C., Hulme, C., & Young, T. (2022). An investigation of Age-Related Differential Item Functioning in the EQ-5D-5L using item response theory and logistic regression. Value in Health, 25(9), 1566–1574.
Article PubMed Google Scholar
Sapin, C., Fantino, B., Nowicki, M. L., & Kind, P. (2004). Usefulness of EQ-5D in assessing Health Status in Primary Care patients with major depressive disorder. Health and Quality of Life Outcomes, 2(1), 20.
Article PubMed PubMed Central Google Scholar
Bilbao, A., Martín-Fernández, J., García-Pérez, L., Mendezona, J. I., Arrasate, M., Candela, R., Acosta, F. J., Estebanez, S., & Retolaza, A. (2022). Psychometric properties of the EQ-5D-5L in patients with major depression: Factor analysis and Rasch analysis. Journal of Mental Health (Abingdon, England), 31(4), 506–516.
Article PubMed Google Scholar
König, H. H., Born, A., Günther, O., Matschinger, H., Heinrich, S., Riedel-Heller, S. G., Angermeyer, M. C., & Roick, C. (2010). Validity and responsiveness of the EQ-5D in assessing and valuing health status in patients with anxiety disorders. Health and Quality of Life Outcomes, 8(1), 47.
Article PubMed PubMed Central Google Scholar
Belay, Y. B., Mihalopoulos, C., Lee, Y. Y., Mulhern, B., & Engel, L. (2023). Examining the psychometric properties of a split version of the EQ-5D-5L anxiety/depression dimension in patients with anxiety and/or depression. Quality of Life Research, 32(7), 2025–2036.
Article PubMed PubMed Central Google Scholar
Supina, A. L., Johnson, J. A., Patten, S. B., Williams, J. V. A., & Maxwell, C. J. (2007). The usefulness of the EQ-5D in differentiating among persons with major depressive episode and anxiety. Quality of Life Research, 16, 749–754.
Article PubMed Google Scholar
Szende, A., & Janssen, B. (2014). Chapter 5: Socio-demographic indicators based on EQ-5D. In A. Szende, B. Janssen, & J. Cabases (Eds.), Self-reported Population Health: An International Perspective based on EQ-5D. Springer.
Szende, A., & Janssen, B. (2014). Chapter 4: Cross-country analysis of EQ-5D data. In A. Szende, B. Janssen, & J. Cabases (Eds.), Self-reported Population Health: An International Perspective based on EQ-5D. Springer.
Swets, J. A. (1988). Measuring the Accuracy of Diagnostic systems. Science, 240(4857), 1285–1293.
Article CAS PubMed Google Scholar
Plummer, F., Manea, L., Trepel, D., & McMillan, D. (2016). Screening for anxiety disorders with the GAD-7 and GAD-2: A systematic review and diagnostic metaanalysis. General Hospital Psychiatry, 39, 24–31.
Article PubMed Google Scholar
Pettersson, A., Boström, K. B., Gustavsson, P., & Ekselius, L. (2015). Which instruments to support diagnosis of depression have sufficient accuracy? A systematic review. Nordic Journal of Psychiatry, 69(7), 497–508.
Article PubMed Google Scholar
Moriarty, A. S., Gilbody, S., McMillan, D., & Manea, L. (2015). Screening and case finding for major depressive disorder using the Patient Health Questionnaire (PHQ-9): A meta-analysis. General Hospital Psychiatry, 37(6), 567–576.
Article PubMed Google Scholar

Download references

Acknowledgements

We would like to thank Carolien Maas, MSc, for her valuable support on the data analysis.

Funding

This study was supported by the EuroQol Research Foundation (grant number: EQ Project 243-RA).

Author information

Authors and Affiliations

Department of Public Health, Erasmus MC, Rotterdam, The Netherlands
Emily Stella Scott & Juanita A. Haagsma
Department of Community Health and Social Medicine, CUNY School of Medicine, New York City, NY, USA
Erica I. Lubetkin
Section Medical Psychology and Psychotherapy, Department of Psychiatry, Erasmus MC, Rotterdam, The Netherlands
Mathieu F. Janssen
Health Department of Economics, National and Kapodistrian University of Athens, Athens, Greece
John N. Yfantopolous
EuroQol Research Foundation, Rotterdam, The Netherlands
Gouke J. Bonsel

Authors

Emily Stella Scott
View author publications
You can also search for this author in PubMed Google Scholar
Erica I. Lubetkin
View author publications
You can also search for this author in PubMed Google Scholar
Mathieu F. Janssen
View author publications
You can also search for this author in PubMed Google Scholar
John N. Yfantopolous
View author publications
You can also search for this author in PubMed Google Scholar
Gouke J. Bonsel
View author publications
You can also search for this author in PubMed Google Scholar
Juanita A. Haagsma
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the conception and design of the study. JAH, GJB, and MFJ designed the questionnaire, collected the data and developed the analytical design. Material preparation, analysis, and interpretation of data were performed by ESS and JAH. ESS wrote the first draft of the manuscript. All authors reviewed and critically revised the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding author

Correspondence to Emily Stella Scott.

Ethics declarations

Ethical approval

This study was performed in line with the principles of the Declaration of Helsinki. Ethical approval was obtained from the Erasmus MC ethics review board (approval MEC-2020-0266). Data were collected anonymously.

Competing interests

All authors except for ESS are members of the EuroQol group. The authors have no relevant financial or non-financial interests to disclose. Views expressed by the authors in the publication do not necessarily reflect those of the EuroQol Group.

Consent to participate

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Scott, E.S., Lubetkin, E.I., Janssen, M.F. et al. The performance relationship between the EQ-5D-5L composite “Anxiety/Depression” dimension and anxiety and depression symptoms in a large, general population sample. Qual Life Res (2024). https://doi.org/10.1007/s11136-024-03754-5

Download citation

Accepted: 27 July 2024
Published: 13 September 2024
DOI: https://doi.org/10.1007/s11136-024-03754-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The performance relationship between the EQ-5D-5L composite “Anxiety/Depression” dimension and anxiety and depression symptoms in a large, general population sample

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

The performance of the EQ-5D-3L in screening for anxiety and depressive symptoms in hospital and community settings

New cut-off points of PHQ-9 and its variants, in Costa Rica: a nationwide observational study

The 14-item short health anxiety inventory (SHAI-14) used as a screening tool: appropriate interpretation and diagnostic accuracy of the Swedish version

Introduction

Methods

Population, data collection and consent

Measures of anxiety and depression

Data analysis

Results

Respondent characteristics and their mental health

Discriminatory performance

Optimal A/D dimension cut-off

Discussion

Performance between diagnostic groups

Age group performance

Singular chronic condition performance

Strengths and limitations

Conclusions

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Competing interests

Consent to participate

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation