Abstract
Purpose
To determine feasibility and validity of the EQ-5D-3L in the elderly European population.
Methods
Secondary data analysis based on the study of health, ageing, and retirement in Europe (SHARE) to determine the percentage of missing items for EQ-5D dimensions and EQ VAS, and to demonstrate convergent/divergent validity with measures included in the SHARE survey. Known-groups validity was tested using literature-based hypotheses. Correlation coefficients and Cohen’s f are reported.
Results
Missing values were below 3% across all EQ-5D dimensions and gender strata, slightly increasing with age. Individuals’ responses to each EQ-5D dimension were related to their ratings of other measures in expected directions. The EQ VAS and all EQ-5D dimensions (except anxiety/depression) moderately to strongly correlated with physical [e.g. number of limitations in activities of daily living (ADL): r = 0.313–0.658] and generic measures [CASP (control, autonomy, self-realization, pleasure)-19 scale, self-perceived health, number of symptoms: r = 0.318–0.622], while anxiety/depression strongly correlated with the EURO-D scale (r = 0.527). Both EQ-5D dimensions and EQ VAS discriminated well between two [or more] groups known to differ [e.g. anxiety/depression discriminated well between persons classified as depressed/not depressed using the EURO-D scale, f = 0.51; self-care differentiated best between individuals without and with 1 + ADL limitations, f = 0.69]. Sociodemographic variables like gender, education, and partner in household were hardly associated with EQ VAS scores (f < 0.25).
Conclusion
With item non-response of less than 3%, good discriminatory, and construct properties, the EQ-5D-3L showed to be a feasible and valid measure in the elderly Europeans.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Background
A long life can bring opportunities for the persons themselves, their relatives, and the society as a whole, but the extent to which a person can make use of additional lifetime essentially depends on at least one factor: health [1].
We can measure, describe, and compare health and its various manifestations—for one person over time, between persons, patient groups, and populations. The individual’s own perception of health is usually captured using patient-reported outcome measures. The EQ-5D belongs to this group of instruments.
Introduced in 1990 by the EuroQol Group and available in more than 170 languages, the EQ-5D-3L (originally “EQ-5D”) is one of the most widely used preference-accompanied measures [2]. It consists of two components—a descriptive system formed by five items each capturing one of five dimensions (mobility, self-care, usual activities, pain/discomfort, anxiety/depression) via three possible response options expressed as the level of problems (level 1: no problems, level 2: some or moderate problems, level 3: extreme problems, unable to or confined to bed), and a vertical thermometer scale ranging from 0 (“worst imaginable health state”) to 100 (“best imaginable health state”), called EQ VAS. The instrument has an important role in describing and understanding population health within and across countries [3,4,5,6]. Although there is a large body of literature on EQ-5D in the general adult population [7, 8], knowledge about the instrument’s measurement properties in the ageing population is scarce [6, 9, 10].
In the face of demographic change, the older and so-called “oldest-old” represent the broadest and fastest growing population groups. This development is no longer limited to the industrialized world and accompanied by a parallel epidemiological trend: the increase in chronic and degenerative diseases [11]. The WHO even speaks of “a dramatically increasing pace of population ageing around the world,” with lower- and mid-income countries now facing the greatest changes with respect to the shift in the ageing pyramid [1].
However, although additional lifetime can be spent in good health, it is evident that physical health problems occur more frequently in the elderly than in the overall adult general population and are increasing with age [12, 13]. A projection to 2060 by Sleeman et al. shows that the burden of serious health-related suffering will almost double by 2060, with the fastest increases among older people, and those with dementia [14].
Against this background, this contribution aims to extend our knowledge of the EQ-5D in the ageing population by investigating its feasibility and validity across 15 European countries.
Methods
This secondary data analysis is based on the 4th Wave data from the Survey of Health, Ageing and Retirement in Europe (SHARE) [15,16,17,18], a representative survey covering the elderly population (50+) in 15 European countries: Austria, Belgium, the Czech Republic, Denmark, France, Germany, Hungary, Italy, the Netherlands, Poland, Portugal, Slovenia, Spain, Sweden, and Switzerland. Data collection took place in 2011 (Germany/Poland: 2011/2012) and was based on computer-assisted personal interviewing (CAPI). Data of the three-level version of the EQ-5D were collected using a standardized short self-completion paper-and-pencil questionnaire (so-called “drop-off” questionnaire) [19, 20]. The questionnaire was always self-completed by the individual; proxy responses were not allowed. It begins with the five items of the descriptive system of the EQ-5D, followed by the EQ VAS, questions about payment for various types of care, out-of-pocket expenses, and four items about loneliness. For some countries, country-specific questions follow. The questionnaire could be given back to the interviewer or posted back in a provided envelope. There is no information on whether assistance is provided when completing the questionnaire in the presence of the interviewee. It should be noted that the instructions of the EQ-5D differ from the original.
Data analysis was done using STATA/SE 16.1 using all available EQ-5D data (no listwise deletion) of panellists aged 50 years and older. For analysis, we built age groups in five-year increments. A description of the data resource and the methodology can be found elsewhere [15, 17].
Variables
For proving both construct validity and known-groups validity, a selection of variables and measures included in the SHARE survey was used depending on the type of validity and the underlying construct. In the following, we are giving some information on the operationalization and definition of those variables. Detailed information (inclusive references) can be found elsewhere [21].
Sample characterization is based on the following variables from SHARE Wave 4: age, gender, country, education [years in school and International Standard Classification of Education (ISCED)], marital status, mean EQ VAS, current job situation, self-perceived health, number of symptoms for at least six months, number of limitations in activities of daily living (ADL), body mass index (BMI), and current smoking behaviour.
Physical measures
The number of limitations in six everyday self-care activities such as walking, eating, and toileting is expressed in a 0–6 ranging ADL index with higher values representing more limitation in activities which are fundamental for maintaining independence [22].
The IADL index describes the number of limitations with seven instrumental activities of daily living like taking medications, making telephone calls, and managing money. The score ranges from 0 to 7 with higher values expressing more limitations in IADL [22].
Long-standing activity limitations (6 month or more) are measured with a global single-item indictor which was developed for comparing health expectancy and disability across Europe. This dichotomous (0 = limited, 1 = not limited) global activities limitation index (GALI) refers to general health problems and activities people usually do [23].
Physical inactivity is operationalized as a dichotomous measure that comprises the answers of two items “We would like to know about the type and amount of physical activity you do in your daily life. How often do you engage in vigorous physical activity, such as sports, heavy housework, or a job that involves physical labour?”, and “How often do you engage in activities that require a moderate level of energy such as gardening, cleaning the car, or doing a walk?” with “1” representing “never vigorous nor moderate physical activity”.
Mental measures
The EURO-D scale used in SHARE comprises 12 items (e.g. depression, guilt, sleep, interest, appetite) that present common symptoms of late-life depression. The composition index ranges from 0 (not depressed) to 12 (very depressed) with values of 4 or higher indicating a case of depression (cut-off) [24].
Anxiety is operationalized as a 5-item indicator. Items such as “fear of the worst happening” or “fear of dying” stem from the Beck Anxiety Inventory and have four response options (1 = “never”, 2 = “hardly ever”, 3 = “some of the time”, 4 = “most of the time”). Higher values indicate higher levels of anxiety.
Cognitive measures
The 10-words-recall test is used to measure cognitive impairments and dementia. Two scores are provided: the number of words the respondent is able to immediately recall after listening to a list of 10 words and the number of words the respondent is able to recall after a delay time (delayed recall) [25].
Another indicator of cognitive impairments, executive functions, is operationalized by a one item verbal fluency test. Respondents have 60 s to name as many different animals as they can think of. The score ranges from 0 to 100 and expresses the number of animals named within one minute. Animals were chosen because they represent a clear and popular semantic category across languages and cultures.
Memory performance is measured using a single item (“How would you rate your memory at the present time? Would you say it is excellent, very good, good, fair or poor?”) with a score ranging from 1 (“excellent”) to 5 (“poor”).
The mathematical performance is measured by five items that test subtraction calculation skills, e.g. “Now let's try some subtraction of numbers. One hundred minus 7 equals what?”, “And 7 from that?”. The score ranges from 1 (“bad”) to 5 (good).
Temporal orientation is operationalized as the respondent’s orientation to date, month, year, and day of week measured by four items (e.g. “Which month is it?”). The score ranges from 0 to 4 with higher values representing better orientation.
General health and other measures
The BMI was calculated according to the WHO definition of 1995 and expresses the weight (in kg) divided by the square of the height (metres2).
Self-perceived health status was assessed using two measures: a single item of self-perceived health with response categories based on the SF-36 (ranging between “excellent” and “poor”) and a 12-item revised version of the CASP-19 scale (the name is an acronym of the four subscales: control, autonomy, self-realization, pleasure). Since for each item, there are 4 response options, the CASP index ranges from 12 (low quality of life) to 48 (high quality of life) [26, 27].
A maximum value of the grip strength measurements of both hands was generated for respondents with two valid measures for each hand and if the two measures for one hand do not differ more than 20 kg.
Number of symptoms reflects the number of health conditions the respondent has been bothered for at least the past six month. For assessing this, respondents were asked to look at a card with several health conditions of which they can choose such as pain in the back, knees, hips, or any other joints, sleeping problems or incontinence.
Feasibility
As a starting point of our analysis, information about the nature, the extent, and the selectivity of missing data and the (assumed) underlying mechanisms was gained by frequency count, and analysis of missing patterns [28]. If it could be ruled out that data are missing not at random (e.g. individuals with a history of anxiety disorder are more likely to omit the EQ-5D item anxiety/depression), feasibility was defined as the unweighted percentage of “not answered” items. For this purpose, we tested the relationship between the frequency distributions of two variables (e.g. using cross-tabulations and χ2-tests) and checked missing correlation patterns (results not shown). Instead of a completion rate, we are reporting the percentage of not answered items for each and all EQ-5D dimensions, for the EQ VAS, and for both EQ-5D dimensions and EQ VAS by gender, age groups, and country. The data were not weighted as it more directly reflects the feasibility of the survey.
Validity
Construct validity was examined by correlating EQ-5D dimensions and EQ VAS with other measures supposed to measure a similar (convergent validity, e.g. measures of mental health like EURO-D scale and EQ-5D anxiety/depression; CASP and EQ VAS) [21], or a dissimilar construct (divergent validity) [29,30,31]. Depending on the measurement level of the variables, the extent of relatedness of EQ-5D and other measures is expressed as a Pearson correlation coefficient (interval scale) or Spearman’s rho (at least one variable ordinal; both range: − 1 to + 1) and interpreted as follows: poor (r < 0.3), moderate (r ≥ 0.3 to r < 0.5), large (r ≥ 0.5) [32]; only correlations significant at the 1% level after Bonferroni adjustment are reported.
Known-groups validity was determined by comparing groups known to differ based on e.g. sociodemographic and health variables like age, gender, marital status, number of symptoms, and self-reported health. Results are expressed as anchor-based distribution of responses (e.g. percentage of “no problems” by category of each variable, mean EQ VAS values by category of each variable). For comparison reasons, we are reporting Cohen’s f as an effect size measure. F estimates the proportion of variance explained by the categorical (grouping) variable [30, 33]. According to Cohen, f is interpreted as follows when used in an analysis of variance (ANOVA):
-
f = 0.10 small effect
-
f = 0.25 medium effect
-
f = 0.40 large effect
For two group comparisons (t test, e.g. comparison of mean EQ VAS values between men and women), we derived Cohen’s f from Cohen’s d by dividing d by 2; for more than two group comparisons (e.g. comparison of mean EQ VAS values across age groups), we converted eta squared (from ANOVA) into Cohen’s f. To examine the association between two categorical variables (e.g. EQ-5D dimension mobility and BMI), we calculated Cramér's ν. V takes values between 0 and 1, with higher values indicating a stronger relationship. In the literature, ν is interpreted differently; we interpret ν = 0.1 as a small, ν = 0.3 as a medium, and ν = 0.5 as a large effect.
The complete validity analysis is based on weighted data. Whenever useful, results are reported stratified by country, age group, and/or gender.
Results
Data from more than fifty thousand panellists were included, with a mean age of 66 years, of which 56% were female. The division into age groups in five-year steps resulted in a nearly equivalent distribution with 10–19% of panellists in each age group. One in four panellists was in school for at least 12 years; two thirds were married or living in a registered partnership, and 58% were retired. Although only less than one fourth of the panellists reported no symptoms, 88% stated to have no limitations with ADL, and more than 40% reported fair or poor self-rated health; mean EQ VAS was 71. Panellist’s characteristics are presented in Table 1.
Feasibility
In the total SHARE sample, only 3% of all respondents have at least one missing value on EQ-5D; responses on the descriptive system are completely missing in 1.32%, on the whole instrument in 0.4%, and on the EQ VAS in 2.8% of the panellists. After stratifying by age and gender, missing values are generally below 3% across all dimensions and strata but most frequently missing in the dimensions self-care (women: 2.4%, men: 2.1%) and anxiety/depression (women: 2.2%, men: 2.1%) (Fig. 1). Overall, there is an increasing trend with higher age where missing responses peak in panellists aged 75–79 years. However, missing values are similarly distributed between men and women (Fig. 1); for EQ VAS, they range from 2.4 to 6.2%, and 2.4 to 5.5% in women and men, respectively, thus, generally higher than the rate of missing values on the descriptive system across age groups. Similarly, we observe a stronger increase in item non-response with advancing age resulting in the highest proportion of unanswered EQ VASs in the oldest-old panellists (Fig. 1).
Considering the total sample, except the Czech Republic, the proportion of not answered EQ-5D items is well below 2% across the five dimensions (Fig. 2a). However, there is great variability at the country level. Responses to the five dimensions are most frequently missing in the Czech Republic ranging between 7.2 and 8.5%, whereas Spain, Italy, and Sweden have consistently less than 1% missing values in all dimensions (Fig. 2a). Again, regarding the EQ VAS, missing values are slightly more prevalent (3.8%). Denmark is notable here with 10.8% missing EQ VAS responses, while items on the descriptive system are missing for less than 1.8% of the panellists indicating some discrepancy between the two components of the EQ-5D (Fig. 2b). This contrasts with the Czech Republic, with the highest number of missing values on the descriptive system but no missing answers on the EQ VAS. A similar but less pronounced gap can also be observed for Germany (EQ VAS: 8.4%, dimensions: 0.9–1.4%) and Sweden (EQ VAS: 5.0%, dimensions: 0.06–0.3%). On the contrary, Slovenia and Italy have consistently few missing responses (for both EQ-5D dimensions and EQ VAS).
Construct validity
Except for anxiety/depression, all EQ-5D dimensions moderately to strongly correlate with most physical measures, while anxiety/depression strongly correlates with the EURO-D (r = 0.527), demonstrating satisfactory convergent and divergent validity (Table 2). Moreover, all EQ-5D dimensions at least moderately and the EQ VAS strongly correlate with generic measures of self-perceived health status, namely CASP (r = 0.323–0.523), self-perceived health (r = 0.316–0.622), and number of symptoms (r = 0.352–0.520). For divergent measures (BMI, maximum grip strength, and all cognitive measures), correlations with EQ-5D dimensions or EQ VAS are small ranging between 0 (e.g. self-care and nervous) and 0.308 (e.g. EQ VAS and immediate 10-words-recall test); for convergent measures, correlations are predominantly found in the range between 0.35 and 0.65, with the highest correlation of r = 0.658 between usual activities and number of limitations in ADL, and r = 0.622 between self-perceived health and EQ VAS (Table 2). For physical and cognitive measures, the correlations with the EQ VAS are increasing with age, while the correlation with self-perceived health is stable across age groups (r50-54yrs = − 0.578 to r70-74yrs = − 0.632, Fig. 3). This is particularly pronounced for ADL (r80+yrs = − 0.480, r50-54yrs = − 0.233) and IADL (r80+yrs = − 0.483, r50-54yrs = − 0.242) with twice as high correlation coefficients in the group of the oldest compared to the youngest panellists.
Known-groups validity
The EQ-5D dimensions and the EQ VAS significantly and similarly differentiate among different stages of cognitive (e.g. memory, orientation), physical (e.g. ADL, IADL, GALI) and general health (e.g. number of symptoms, BMI, depression), demonstrating a satisfactory discriminative ability of the EQ-5D (Table 3). Regarding sociodemographic variables, we found that women rate their state of health worse than men, people living with a partner report fewer problems in all dimensions, and a social gradient regarding education and the current job situation. Again, the proportion of panellists reporting “no problems” and the mean EQ VAS are continuously decreasing with age; with the biggest declines for the oldest age group of 80+. EQ VAS scores show the largest relationship with general health measures like number of symptoms and self-perceived health with a Cohen’s f of > 0.8, followed by limitations with activities of daily living (ADL, IADL, GALI: f > 0.5) and a case of depression (which means a scale score of 4 or higher at the EURO-D scale, f > 0.4, Table 3). However, the association with sociodemographic variables like gender, education, and partner in household is small (f < 0.25), while correlations with cognitive measures, age, and current job situation are moderate (f > 0.3). The discriminatory ability of the individual EQ-5D dimensions is reflected in higher effect sizes in the respective corresponding groups, e.g. anxiety/depression separates best between individuals with and without depression (f = 0.51), self-care between individuals without ADL limitations and 1 + ADL limitations (f = 0.69), and pain/discomfort between individuals without IADL limitations and 1 + IADL limitations (f = 0.61).
Discussion
To the best of our knowledge, this is the first study to assess the feasibility, construct validity, and known-groups validity of the EQ-5D in a large-scale sample of older Europeans across 15 countries. Our findings indicate excellent feasibility of the EQ-5D descriptive system and the EQ VAS. Additionally, both showed a satisfactory construct validity and distinguished well between known-group differences. It is a major strength of this study that the overall analysis was conducted on the full-severity range using non-dichotomized EQ-5D data, which is common practice in the analysis of general population data.
Overall, just 3% of all respondents had one or more missing values, which implicates very good feasibility. The magnitude of missing values is in line with those reported for the general adult population [34,35,36,37,38], also for the elderly population [39, 40]. After stratifying the sample by age and gender, missing values were mostly below 3% across all five dimensions and, hence, were lower than previously published rates for the elderly population [41,42,43,44]. Further, we observed slightly increasing rates of missing responses to the descriptive system with higher age, which was already described elsewhere. However, this gradient was not as pronounced as in other studies [10, 40, 47, 48]. At the dimension level, self-care and anxiety/depression had somewhat higher proportions of missing values. While there is evidence that anxiety/depression may cause response issues and some embarrassment in the elderly population [49, 50], a higher prevalence of missing values in self-care was instead seen in patient samples with dementia [45, 46]. Furthermore, gender did not seem to be associated with completeness of the descriptive system, since men and women had similar frequencies of missing values. Nonetheless, evidence is inconclusive on the role of gender regarding this, as other studies suggest [35, 47]. With respect to the EQ VAS, we found slightly more absolute missing values in comparison to the descriptive system and a steeper increase in proportions of missing values with higher age. This is not surprising given that comprehension problems with the EQ VAS are well documented in older adults [42, 49, 51]. Nevertheless, the number of missing values described in this study is at the lower end of the range that is commonly reported in older samples [10].
To the best of our knowledge, this is the first study to conduct a multi-country comparison of missing values on the EQ-5D. With respect to missing values on the five dimensions and the EQ VAS, we found some degree of variability at the country level. Variability is even more pronounced with respect to missing EQ VAS values. However, we do not have an explanation for this. Although interviewer effects appear to be an obvious explanation [52], they are implausible in view of the highly standardized procedure in SHARE and due to the paper–pencil format. (The panellist answers the questionnaire following the interview). Since cultural measurement equivalence for different language versions of the EQ-5D has been proven several times [53,54,55,56], this explanation seems unlikely as well. Nevertheless, all country-level results are within an acceptable range, which are within the limits of what was to be expected from published literature [10].
Construct validity, i.e. convergent validity and divergent validity, of the five EQ-5D dimensions and EQ VAS was sustained by confirmation of the anticipated relationships with measures of physical, mental, cognitive, and general health as well as measures of self-perceived health status. Reassuringly, we found strong convergent validity with core aspects of the EQ-5D such as physical health and generic aspects of self-perceived health status (e.g. CASP, number of symptoms, and self-perceived health), which indicates that the EQ-5D measures overlapping constructs with the aforementioned scales. This is broadly in line with earlier studies [38, 57,58,59]. On the other hand, there is clear evidence of divergence between the EQ-5D components and constructs measuring aspects of mental and cognitive health as well as BMI or grip strength. Due to the rather low correlation, it appears that these constructs are not well reflected by the EQ-5D. These findings support the need for additional measures supplementing the self-reported health assessment in older respondents, as cognition is known to deteriorate with higher age [60] or where hand grip strength is assessed to detect older adults at risk of physical decline [61]. The prevalence of cognitive impairment (CI) increases with age [62, 63], and CI affects the live of affected individuals, negatively impacting the autonomy [64]. Since the findings of this study show an increasing impact of physical and cognitive aspects on self-perceived health status with age, and cognitive aspects are not captured by the EQ-5D, studies investigating older persons should consider the additional use of the cognition bolt-on [65,66,67]. However, in general, a general measure such as the EQ-5D appears to better capture the overall impact on health of the ageing population than a measure that addresses only a single aspect or dimension of health.
Both the EQ-5D descriptive system and the EQ VAS differentiate well between groups with known health differences, thus, demonstrating evidence to support known-groups validity for the EQ-5D. For sociodemographic variables, our findings are well aligned with results from the published literature and point in the expected direction with those being female, with lower education, living alone, and being unemployed had lower health in terms of EQ VAS and more problems reported on the descriptive system [40, 57, 59, 68,69,70,71,72]. Similarly, we found evidence to support differences between age groups, where health decreases monotonically with increasing age with a greater dip for the oldest age group, which supports findings from earlier studies [57, 68, 71, 72]. Interestingly, four studies reported some inconsistency with regard to age and the proportion of reported problems in anxiety/depression, which seemed to plateau or even decrease in older age groups [40, 59, 71, 72]. However, this pattern was not replicated in our findings indicating known-groups validity of the anxiety/depression dimension regarding different age groups. Moreover, known-group differences based on physical and general health measures were also confirmed for the descriptive system and the EQ VAS by our findings and are in line with the hypotheses suggested by the literature [45, 71]. Additionally, this study adds new validity evidence to the literature, since both components, the descriptive system and the EQ VAS, satisfactorily differentiated among groups with different stages of cognitive health in this sample of older Europeans, whereas the literature is otherwise scarce. There is mixed published evidence with respect to the EQ VAS’s ability to detect differences among severity levels in patients with dementia [46, 73]. For the descriptive system, potential ceiling effects were described for dementia patients [74,75,76], where the lack of sensitivity essentially prevents satisfactory discrimination of subgroups; the only exception was anxiety/depression, which showed positive association with a severity classification measure of dementia [46]. Despite these contradictory findings from the literature, the EQ-5D was able to demonstrate different levels of problems for groups with varying cognitive health.
This study’s major strength is the large sample size available for the analysis of EQ-5D data, which provides two distinct advantages. First, the sample includes a wide spread of reported health problems and, hence, offers the potential to analyse measurement properties across the whole severity range. Second, the sample size is sufficient to detect known-group differences, whereas other studies often lack the power to detect statistically significant differences. However, some limitations need to be considered. The underlying data may be prone to a selection bias favouring respondents who are potentially healthier and more independent as those respondents are more likely to respond to a survey. Thus, severely ill, institutionalized, and non-independent respondents may be underrepresented in this sample. Consequently, the demonstrated measurement properties may not be fully representative for those respondents. Similarly, we have no information as to whether respondents required or received help when answering the EQ-5D. Hence, we cannot control for this factor, and it should be considered that feasibility aspects may be overestimated.
Moreover, there is a newer version of the EQ-5D with an extended response scale (five levels) and improved measurement properties (e.g. in terms of sensitivity and distribution properties) available [77], and based on the demonstrated superior ability of the generic EQ-5D to capture the overall impact on health of the ageing population, a shift to the refined five-level EQ-5D may be considered to provide an even more accurate assessment of health in the elderly SHARE population.
Conclusion
In conclusion, the EQ-5D descriptive system and the EQ VAS demonstrated very good feasibility properties, construct validity, and known-groups validity in a sample of pan-European older adults. Our results provide further evidence to strengthen the use of self-administered EQ-5D in older populations achieving a high degree of instrument completion. A generic measure such as the EQ-5D seems to better capture the overall impact on health of the ageing population when compared to measures dedicated to a single aspect or dimension of health only.
References
World Health Organization (2021). Ageing and health
Devlin, N., Parkin, D., & Janssen, B. (2020). An introduction to EQ-5D instruments and their applications. In N. Devlin, D. Parkin, & B. Janssen (Eds.), Methods for analysing and reporting EQ-5D data (pp. 1–22). Springer.
Martí-Pastor, M., Pont, A., Ávila, M., Garin, O., Vilagut, G., Forero, C. G., et al. (2018). Head-to-head comparison between the EQ-5D-5L and the EQ-5D-3L in general population health surveys. Population Health Metrics, 16(1), 14. https://doi.org/10.1186/s12963-018-0170-8
Rencz, F., Gulácsi, L., Drummond, M., Golicki, D., Prevolnik Rupel, V., Simon, J., et al. (2016). EQ-5D in Central and Eastern Europe: 2000–2015. Quality of Life Research, 25(11), 2693–2710. https://doi.org/10.1007/s11136-016-1375-6
Szende, A., Janssen, B., & Cabases, J. (Eds.). (2014). Self-reported population health: An international perspective based on EQ-5D. Springer.
König, H.-H., Heider, D., Lehnert, T., Riedel-Heller, S. G., Angermeyer, M. C., Matschinger, H., et al. (2010). Health status of the advanced elderly in six European countries: Results from a representative survey using EQ-5D and SF-12. Health and Quality of Life Outcomes, 8, 143. https://doi.org/10.1186/1477-7525-8-143
Feng, Y.-S., Kohlmann, T., Janssen, M. F., & Buchholz, I. (2021). Psychometric properties of the EQ-5D-5L: A systematic review of the literature. Quality of Life Research, 30(3), 647–673. https://doi.org/10.1007/s11136-020-02688-y
Buchholz, I., Janssen, M. F., Kohlmann, T., & Feng, Y.-S. (2018). A systematic review of studies comparing the measurement properties of the three-level and five-level versions of the EQ-5D. PharmacoEconomics, 36(6), 645–661. https://doi.org/10.1007/s40273-018-0642-5
König, H.-H., Brettschneider, C., Lühmann, D., Kaduszkiewicz, H., Oey, A., Wiese, B., et al. (2020). EQ-5D-3L health status and health state utilities of the oldest-old (85 +) in Germany: Results from the AgeCoDe-AgeQualiDe study. Quality of Life Research, 29(12), 3223–3232. https://doi.org/10.1007/s11136-020-02597-0
Marten, O., Brand, L., & Greiner, W. (2021). Feasibility of the EQ-5D in the elderly population: A systematic review of the literature. Quality of Life Research. https://doi.org/10.1007/s11136-021-03007-9
Maresova, P., Javanmardi, E., Barakovic, S., Barakovic Husic, J., Tomsone, S., Krejcar, O., et al. (2019). Consequences of chronic diseases and other limitations associated with old age—a scoping review. BMC Public Health, 19(1), 1431. https://doi.org/10.1186/s12889-019-7762-5
Kehler, D. S. (2019). Age-related disease burden as a measure of population ageing. The Lancet Public Health, 4(3), e123–e124. https://doi.org/10.1016/S2468-2667(19)30026-X
Divo, M. J., Martinez, C. H., & Mannino, D. M. (2014). Ageing and the epidemiology of multimorbidity. The European Respiratory Journal, 44(4), 1055–1068. https://doi.org/10.1183/09031936.00059814
Sleeman, K. E., de Brito, M., Etkind, S., Nkhoma, K., Guo, P., Higginson, I. J., et al. (2019). The escalating global burden of serious health-related suffering: Projections to 2060 by world regions, age groups, and health conditions. The Lancet Global Health, 7(7), e883–e892. https://doi.org/10.1016/S2214-109X(19)30172-X
Börsch-Supan, A. (2020). Survey of health, ageing and retirement in Europe (SHARE) wave 4: SHARE–ERIC. Springer.
Börsch-Supan, A., Brandt, M., Litwin, H., & Weber, G. (2013). Active ageing and solidarity between generations in Europe: First results from SHARE after the economic crisis. De Gruyter.
Malter, F., & Börsch-Supan, A. (2013). SHARE wave 4: Innovations & methodology. Max Planck Institute for Social Law and Social Policy.
Börsch-Supan, A., Brandt, M., Hunkler, C., Kneip, T., Korbmacher, J., Malter, F., et al. (2013). Data resource profile: The survey of health, ageing and retirement in Europe (SHARE). International journal of epidemiology, 42(4), 992–1001. https://doi.org/10.1093/ije/dyt088
Börsch-Supan, A. (2011). SHARE wave 4.
Börsch-Supan, A. (2011). SHARE wave 4—Dropoff questionnaire variables
Mehrbrodt, T., Gruber, S., & Wagner, M. (2019). Scales and multi-item indicators in the survey of health, ageing and retirement in Europe. SHARE Working Paper Series.
Nicholas, S., Huppert, F. A., McWilliams, B., & Melzer, D. (2003). Physical and cognitive function. In M. Marmot, J. Banks, R. Blundell, & C. Lessof (Eds.), Health, wealth, and lifestyles of the older population in England: The 2002 English Longitudinal Study of Aging (pp. 249–271). Hrsg.
van Oyen, H., van der Heyden, J., Perenboom, R., & Jagger, C. (2006). Monitoring population disability: Evaluation of a new global activity limitation indicator (GALI). Sozial- und Praventivmedizin, 51(3), 153–161. https://doi.org/10.1007/s00038-006-0035-y
Prince, M. J., Reischies, F., Beekman, A. T., Fuhrer, R., Jonker, C., Kivela, S. L., et al. (1999). Development of the EURO-D scale-a European, union initiative to compare symptoms of depression in 14 European centres. The British Journal of Psychiatry: The Journal of Mental Science, 174, 330–338. https://doi.org/10.1192/bjp.174.4.330
Harris, S. J., & Dowson, J. H. (1982). Recall of a 10-word list in the assessment of dementia in the elderly. The British Journal of Psychiatry: The Journal of Mental Science, 141, 524–527. https://doi.org/10.1192/bjp.141.5.524
Hyde, M., Wiggins, R. D., Higgs, P., & Blane, D. B. (2003). A measure of quality of life in early old age: The theory, development and properties of a needs satisfaction model (CASP-19). Aging & Mental Health, 7(3), 186–194. https://doi.org/10.1080/1360786031000101157
von dem Knesebeck, O., Hyde, M., Higgs, P., Kupfer, A., & Siegrist, J. (2005). Quality of Life and Well-Being. In A. Börsch-Supan (Ed.), Health, ageing and retirement in Europe: First results from the survey of health ageing and retirement in Europe (pp. 199–203). Mannheim Research Institute for the Economics of Aging (MEA).
StataCorp. (2013). Stata 13 base reference manual/rmisstable. London: StataCorp.
Chin, C.-L., & Yao, G. (2014). Convergent validity. In A. C. Michalos (Ed.), Encyclopedia of quality of life and well-being research (pp. 1275–1276). Springer.
Cohen, R. J., & Swerdlik, M. E. (2005). Psychological testing and assessment: An introduction to tests and measurement (6th ed.). McGraw-Hill.
Reeve, B. B., Wyrwich, K. W., Wu, A. W., Velikova, G., Terwee, C. B., Snyder, C. F., et al. (2013). ISOQOL recommends minimum standards for patient-reported outcome measures used in patient-centered outcomes and comparative effectiveness research. Quality of Life Research, 22(8), 1889–1905. https://doi.org/10.1007/s11136-012-0344-y
Hinkle, D. E., Wiersma, W., & Jurs, S. G. (2003). Applied statistics for the behavioral sciences (5th ed.). Houghton Mifflin.
Lenhard, W., & Lenhard, A. (2016). Calculation of effect size. Berechnung von Effektstärken. https://doi.org/10.13140/RG.2.2.17823.92329
Perneger, T. V., Combescure, C., & Courvoisier, D. S. (2010). General population reference values for the French version of the EuroQol EQ-5D health utility instrument. Value in Health, 13(5), 631–635. https://doi.org/10.1111/j.1524-4733.2010.00727.x
Barton, G. R., Sach, T. H., Avery, A. J., Jenkinson, C., Doherty, M., Whynes, D. K., et al. (2008). A comparison of the performance of the EQ-5D and SF-6D for individuals aged or= 45 years. Health Economics, 17(7), 815–832. https://doi.org/10.1002/hec.1298
Agborsangaya, C. B., Lahtinen, M., Cooke, T., & Johnson, J. A. (2014). Comparing the EQ-5D 3L and 5L: Measurement properties and association with chronic conditions and multimorbidity in the general population. Health and Quality of Life Outcomes, 12, 74. https://doi.org/10.1186/1477-7525-12-74
Młyńczak, K., & Golicki, D. (2021). Validity of the EQ-5D-5L questionnaire among the general population of Poland. Quality of Life Research, 30(3), 817–829. https://doi.org/10.1007/s11136-020-02667-3
Badia, X., Schiaffino, A., Alonso, J., & Herdman, M. (1998). Using the EuroQoI-5D in the Catalan general population: Feasibility and construct validity. Quality of Life Research, 7(4), 311–322. https://doi.org/10.1023/a:1024933913698
Christiansen, A. S. J., Møller, M. L. S., Kronborg, C., Haugan, K. J., Køber, L., Højberg, S., et al. (2021). Comparison of the three-level and the five-level versions of the EQ-5D. The European Journal of Health Economics, 22(4), 621–628. https://doi.org/10.1007/s10198-021-01279-z
Luthy, C., Cedraschi, C., Allaz, A.-F., Herrmann, F. R., & Ludwig, C. (2015). Health status and quality of life: Results from a national survey in a community-dwelling sample of elderly people. Quality of Life Research, 24(7), 1687–1696. https://doi.org/10.1007/s11136-014-0894-2
Coast, J., Peters, T. J., Richards, S. H., & Gunnell, D. J. (1998). Use of the EuroQol among elderly acute care patients. Quality of Life Research, 7(1), 1–10. https://doi.org/10.1023/a:1008857203434
Hulme, C., Long, A. F., Kneafsey, R., & Reid, G. (2004). Using the EQ-5D to assess health-related quality of life in older people. Age and Ageing, 33(5), 504–507. https://doi.org/10.1093/ageing/afh178
Brazier, J. E., Walters, S. J., Nicholl, J. P., & Kohler, B. (1996). Using the SF-36 and EuroQol on an elderly population. Quality of Life Research, 5(2), 195–204. https://doi.org/10.1007/BF00434741
Davis, J. C., Liu-Ambrose, T., Richardson, C. G., & Bryan, S. (2013). A comparison of the ICECAP-O with EQ-5D in a falls prevention clinical setting: Are they complements or substitutes? Quality of Life Research, 22(5), 969–977. https://doi.org/10.1007/s11136-012-0225-4
Michalowsky, B., Xie, F., Kohlmann, T., Gräske, J., Wübbeler, M., Thyrian, J. R., et al. (2020). Acceptability and validity of the EQ-5D in patients living with dementia. Value in Health, 23(6), 760–767. https://doi.org/10.1016/j.jval.2020.01.022
Ankri, J., Beaufils, B., Novella, J.-L., Morrone, I., Guillemin, F., Jolly, D., et al. (2003). Use of the EQ-5D among patients suffering from dementia. Journal of Clinical Epidemiology, 56(11), 1055–1063. https://doi.org/10.1016/S0895-4356(03)00175-6
Menn, P., Weber, N., & Holle, R. (2010). Health-related quality of life in patients with severe COPD hospitalized for exacerbations—comparing EQ-5D, SF-12 and SGRQ. Health and Quality of Life Outcomes, 8, 39. https://doi.org/10.1186/1477-7525-8-39
Gerard, K., Nicholson, T., Mullee, M., Mehta, R., & Roderick, P. (2004). EQ-5D versus SF-6D in an older, chronically Ill patient group. Applied Health Economics and Health Policy, 3(2), 91–102. https://doi.org/10.2165/00148365-200403020-00005
Holland, R., Smith, R. D., Harvey, I., Swift, L., & Lenaghan, E. (2004). Assessing quality of life in the elderly: A direct comparison of the EQ-5D and AQoL. Health Economics, 13(8), 793–805. https://doi.org/10.1002/hec.858
van Leeuwen, K. M., Jansen, A. P. D., Muntinga, M. E., Bosmans, J. E., Westerman, M. J., van Tulder, M. W., et al. (2015). Exploration of the content validity and feasibility of the EQ-5D-3L, ICECAP-O and ASCOT in older adults. BMC Health Services Research, 15, 201. https://doi.org/10.1186/s12913-015-0862-8
Hickson, M., & Frost, G. (2004). An investigation into the relationships between quality of life, nutritional status and physical function. Clinical Nutrition, 23(2), 213–221. https://doi.org/10.1016/S0261-5614(03)00127-4
Bowling, A. (2005). Mode of questionnaire administration can have serious effects on data quality. Journal of Public Health (Oxford, England), 27(3), 281–291. https://doi.org/10.1093/pubmed/fdi031
Luo, N., Chew, L.-H., Fong, K.-Y., Koh, D.-R., Ng, S.-C., Yoon, K.-H., et al. (2003). Do English and Chinese EQ-5D versions demonstrate measurement equivalence? An exploratory study. Health and Quality of Life Outcomes, 1, 7. https://doi.org/10.1186/1477-7525-1-7
Luo, N., Li, M., Chevalier, J., Lloyd, A., & Herdman, M. (2013). A comparison of the scaling properties of the English, Spanish, French, and Chinese EQ-5D descriptive systems. Quality of Life Research, 22(8), 2237–2243. https://doi.org/10.1007/s11136-012-0342-0
Luo, N., Wang, Y., How, C. H., Wong, K. Y., Shen, L., Tay, E. G., et al. (2015). Cross-cultural measurement equivalence of the EQ-5D-5L items for English-speaking Asians in Singapore. Quality of Life Research, 24(6), 1565–1574. https://doi.org/10.1007/s11136-014-0864-8
Wang, Y., Tan, N.-C., Tay, E.-G., Thumboo, J., & Luo, N. (2015). Cross-cultural measurement equivalence of the 5-level EQ-5D (EQ-5D-5L) in patients with type 2 diabetes mellitus in Singapore. Health and Quality of Life Outcomes, 13, 103. https://doi.org/10.1186/s12955-015-0297-2
Savoia, E., Fantini, M. P., Pandolfi, P. P., Dallolio, L., & Collina, N. (2006). Assessing the construct validity of the Italian version of the EQ-5D: Preliminary results from a cross-sectional study in North Italy. Health and Quality of Life Outcomes, 4, 47. https://doi.org/10.1186/1477-7525-4-47
van Leeuwen, K. M., Bosmans, J. E., Jansen, A. P. D., Hoogendijk, E. O., van Tulder, M. W., van der Horst, H. E., et al. (2015). Comparing measurement properties of the EQ-5D-3L, ICECAP-O, and ASCOT in frail older adults. Value in Health, 18(1), 35–43. https://doi.org/10.1016/j.jval.2014.09.006
Janssen, M. F., Pickard, A. S., Golicki, D., Gudex, C., Niewada, M., Scalone, L., et al. (2013). Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L across eight patient groups: A multi-country study. Quality of Life Research, 22(7), 1717–1727. https://doi.org/10.1007/s11136-012-0322-4
Murman, D. L. (2015). The impact of age on cognition. Seminars in Hearing, 36(3), 111–121. https://doi.org/10.1055/s-0035-1555115
Halaweh, H. (2020). Correlation between health-related quality of life and hand grip strength among older adults. Experimental Aging Research, 46(2), 178–191. https://doi.org/10.1080/0361073X.2020.1716157
Pais, R., Ruano, L., Moreira, C., Carvalho, O. P., & Barros, H. (2020). Prevalence and incidence of cognitive impairment in an elder Portuguese population (65–85 years old). BMC Geriatrics, 20(1), 470. https://doi.org/10.1186/s12877-020-01863-7
Kiejna, A., Frydecka, D., Adamowski, T., Bickel, H., Reynish, E., Prince, M., et al. (2011). Epidemiological studies of cognitive impairment and dementia across Eastern and Middle European countries (epidemiology of dementia in Eastern and Middle European Countries). International Journal of Geriatric Psychiatry, 26(2), 111–117. https://doi.org/10.1002/gps.2511
Hussenoeder, F. S., Conrad, I., Roehr, S., Fuchs, A., Pentzek, M., Bickel, H., et al. (2020). Mild cognitive impairment and quality of life in the oldest old: A closer look. Quality of Life Research: An International Journal of Quality of Life Aspects of Treatment, Care and Rehabilitation, 29(6), 1675–1683. https://doi.org/10.1007/s11136-020-02425-5
Geraerds, A. J. L. M., Bonsel, G. J., Janssen, M. F., de Jongh, M. A., Spronk, I., Polinder, S., et al. (2019). The added value of the EQ-5D with a cognition dimension in injury patients with and without traumatic brain injury. Quality of Life Research, 28(7), 1931–1939. https://doi.org/10.1007/s11136-019-02144-6
Geraerds, A. J. L. M., Bonsel, G. J., Janssen, M. F., Finch, A. P., Polinder, S., & Haagsma, J. A. (2021). Methods used to identify, test, and assess impact on preferences of bolt-ons: A systematic review. Value in Health, 24(6), 901–916. https://doi.org/10.1016/j.jval.2020.12.011
Finch, A. P., Brazier, J., & Mukuria, C. (2021). Selecting bolt-on dimensions for the EQ-5D: Testing the impact of hearing, sleep, cognition, energy, and relationships on preferences using pairwise choices. Medical Decision Making, 41(1), 89–99. https://doi.org/10.1177/0272989X20969686
Kontodimopoulos, N., Pappa, E., Niakas, D., Yfantopoulos, J., Dimitrakaki, C., & Tountas, Y. (2008). Validity of the EuroQoL (EQ-5D) instrument in a Greek general population. Value in Health, 11(7), 1162–1169. https://doi.org/10.1111/j.1524-4733.2008.00356.x
Diaz-Redondo, A., Rodriguez-Blazquez, C., Ayala, A., Martinez-Martin, P., & Forjaz, M. J. (2014). EQ-5D rated by proxy in institutionalized older adults with dementia: Psychometric pros and cons. Geriatrics & Gerontology International, 14(2), 346–353. https://doi.org/10.1111/ggi.12108
N’Goran, A. A., Déruaz-Luyet, A., Haller, D. M., Zeller, A., Rosemann, T., Streit, S., et al. (2017). Comparing the self-perceived quality of life of multimorbid patients and the general population using the EQ-5D-3L. PLoS ONE, 12(12), e0188499. https://doi.org/10.1371/journal.pone.0188499
Burström, K., Johannesson, M., & Diderichsen, F. (2001). Swedish population health-related quality of life results using the EQ-5D. Quality of Life Research, 10(7), 621–635. https://doi.org/10.1023/a:1013171831202
Kind, P., Dolan, P., Gudex, C., & Williams, A. (1998). Variations in population health status: Results from a United Kingdom national questionnaire survey. BMJ, 316(7133), 736–741. https://doi.org/10.1136/bmj.316.7133.736
Schrag, A., Selai, C., Jahanshahi, M., & Quinn, N. P. (2000). The EQ-5D–a generic quality of life measure-is a useful instrument to measure quality of life in patients with Parkinson’s disease. Journal of Neurology, Neurosurgery, and Psychiatry, 69(1), 67–73. https://doi.org/10.1136/jnnp.69.1.67
Coucill, W., Bryan, S., Bentham, P., Buckley, A., & Laight, A. (2001). EQ-5D in patients with dementia: An investigation of inter-rater agreement. Medical Care, 39(8), 760–771. https://doi.org/10.1097/00005650-200108000-00003
Karlawish, J. H., Zbrozek, A., Kinosian, B., Gregory, A., Ferguson, A., & Glick, H. A. (2008). Preference-based quality of life in patients with Alzheimer’s disease. Alzheimer’s & Dementia: The Journal of the Alzheimer’s Association, 4(3), 193–202. https://doi.org/10.1016/j.jalz.2007.11.019
Naglie, G., Tomlinson, G., Tansey, C., Irvine, J., Ritvo, P., Black, S. E., et al. (2006). Utility-based quality of life measures in Alzheimer’s disease. Quality of Life Research, 15(4), 631–643. https://doi.org/10.1007/s11136-005-4364-8
EuroQol Research Foundation (2019). EQ-5D-5L user guide
Acknowledgements
This paper uses data from SHARE Wave 4 (https://doi.org/10.6103/SHARE.w4.710), see Börsch-Supan et al. (2013) for methodological details. The SHARE data collection has been funded by the European Commission through FP5 (QLK6-CT-2001-00360), FP6 (SHARE-I3: RII-CT-2006-062193, COMPARE: CIT5-CT-2005-028857, SHARELIFE: CIT4-CT-2006-028812), FP7 (SHARE-PREP: GA N°211909, SHARE-LEAP: GA N°227822, SHARE M4: GA N°261982, DASISH: GA N°283646) and Horizon 2020 (SHARE-DEV3: GA N°676536, SHARE-COHESION: GA N°870628, SERISS: GA N°654221, SSHOC: GA N°823782) and by DG Employment, Social Affairs & Inclusion. Additional funding from the German Ministry of Education and Research, the Max Planck Society for the Advancement of Science, the U.S. National Institute on Ageing (U01_AG09740-13S2, P01_AG005842, P01_AG08291, P30_AG12815, R21_AG025169, Y1-AG-4553-01, IAG_BSR06-11, OGHA_04-064, HHSN271201300071C) and from various national funding sources is gratefully acknowledged (see www.share-project.org).
Funding
We thank the EuroQol Research Foundation for approving and funding our project and SHARE for providing and granting permission to use the data from Wave 4.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Buchholz, I., Marten, O. & Janssen, M.F. Feasibility and validity of the EQ-5D-3L in the elderly Europeans: a secondary data analysis using SHARE(d) data. Qual Life Res 31, 3267–3282 (2022). https://doi.org/10.1007/s11136-022-03158-3
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11136-022-03158-3