Abstract
Background
Atopic dermatitis (AD) is a common chronic inflammatory skin disorder affecting up to 10% of adults. The EQ-5D is the most commonly used generic preference-accompanied measure to generate quality-adjusted life years (QALYs) for economic evaluations.
Objectives
We aimed to compare psychometric properties of the three-level and five-level EQ-5D (EQ-5D-3L and EQ-5D-5L) in adult patients with AD.
Methods
In a multicentre cross-sectional study, 218 AD patients with a broad range of severity completed the EQ-5D-3L, EQ-5D-5L, Dermatology Life Quality Index (DLQI) and Skindex-16. Disease severity outcomes included the Investigator Global Assessment, Eczema Area and Severity Index and the objective SCORing Atopic Dermatitis.
Results
A good agreement was established between the two EQ-5D versions with an intraclass correlation coefficient of 0.815 (95% CI 0.758–0.859, p < 0.001). Overall, 33 different health state profiles occurred in the EQ-5D-3L and 84 in the EQ-5D-5L. Compared to the EQ-5D-3L, ceiling effect was reduced for the mobility, self-care, usual activities and pain/discomfort dimensions by 4.6–11.5%. EQ-5D-5L showed higher average relative informativity (Shannon’s evenness index: 0.64 vs. 0.59). EQ-5D-5L demonstrated better convergent validity with EQ VAS, DLQI and Skindex-16. The two measures were similar in distinguishing between groups of patients based on disease severity and skin-specific quality of life with a moderate or large effect size (η2 = 0.083–0.489).
Conclusion
Both instruments exhibited good psychometric properties in AD; however, the EQ-5D-5L was superior in terms of ceiling effects, informativity and convergent validity. We recommend the use of the EQ-5D-5L to measure health outcomes in clinical settings and for QALY calculations in AD.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Atopic dermatitis (AD) is a common chronic inflammatory skin disorder affecting up to 10% of adults [1, 2]. It can appear on any area of the body, but predilection sites are the face, hands, and flexural surfaces of the extremities [1]. Clinical symptoms include recurrent eczematous lesions and intense itch that may considerably decrease patients’ health-related quality of life (HRQoL). The excessive dryness, itching and scratching may cause substantial limitations in daily functioning, social interactions, leisure activities, and may lead to sleep disturbance [3,4,5,6]. Treatments include topical emollients, topical corticosteroids, calcineurin inhibitors and systemic immunosuppressants (e.g. corticosteroids, cyclosporine A, azathioprine, mycophenolate mofetil or methotrexate), according to disease severity [7, 8]. Recently, an increasing number of new treatment options have become available for moderate-to-severe AD, such as targeted biological therapies (dupilumab and tralokinumab) and small molecules (baricitinib, abrocitinib and upadacitinib) [9].
AD represents a large burden on patients and society with an average annual total cost per patient of up to €20,000 [10,11,12]. New treatments typically require more health resources, and thus, providing evidence on their cost-effectiveness is important to show their value for money and to support financial decision-making in healthcare [13]. In these economic evaluations, quality-adjusted life year (QALY) is used as a summary outcome that combines quantity and quality of life. Using generic preference-accompanied instruments is the most common way to assess HRQoL to generate QALYs. These measures consist of a descriptive system and a set of utility values [14]. The most commonly used generic preference-accompanied measure is the EQ-5D [15]. Over the past three decades, it has been used in over 10,000 studies and by now, it has become a preferred instrument to estimate QALYs in pharmacoeconomic guidelines in nearly 30 countries [16,17,18].
The EQ-5D has two versions for adults, the three-level (EQ-5D-3L, hereafter 3L) [19], and the five-level (EQ-5D-5L, hereafter 5L) [20]. Both have been increasingly used in dermatological patient populations [21,22,23,24]. The major difference between the two adult EQ-5D questionnaires is that the 5L includes not three, but five levels in each dimension and uses a standardised wording across dimensions. In many countries, including Hungary, both adult questionnaires are recommended by pharmacoeconomic guidelines [18]; however, these may lead to different cost-effectiveness outcomes, therefore understanding their psychometric properties in different contexts and settings is critical to inform the debate about the choice of instrument.
Several previous studies in different health condition groups and general population samples showed improved measurement properties of the 5L descriptive system, such as reduced ceiling effect, better informativity and construct validity [25, 26]. Among dermatological conditions, the measurement properties of the 3L and 5L have been compared in psoriasis [27] and hidradenitis suppurativa (HS) [28]; however, no comparative study is available in AD. There could be large differences in how the descriptive systems perform across different health conditions, even among chronic skin diseases. Furthermore, it is important to examine how measurement properties of the descriptive systems translate into the discriminatory power of utilities, as this has a direct impact on QALYs.
This study therefore seeks to compare the psychometric properties of the 3L and 5L with regard to both the descriptive systems and utilities (hereafter referred to as index scores in the context of the EQ-5D) in adult patients with AD. The Hungarian 3L and 5L value sets will be used to estimate index scores that were developed in a parallel valuation study using the same respondents (n = 1000), protocol (i.e. EQ-VT), valuation method (i.e. composite time trade-off) and modelling approach (i.e. heteroscedastic Tobit) [29]. This will give us a unique opportunity to compare not only the descriptive systems but also utilities using real-world patient data. We aim to focus on the following psychometric properties: ceiling and floor effect, agreement, redistribution properties, informativity, convergent and known-groups validity.
Methods
Study design and patients
Between March 2018 and January 2021, a cross-sectional, multicentre study was conducted in Hungary among consecutive adult AD patients. Data were collected at two university dermatology clinics in Budapest and Debrecen and an outpatient centre in Pannonhalma. In each study site, patients were asked to read and sign an informed consent form on paper before participating in the study. Ethical approval was granted by the Scientific and Ethical Committee of the Medical Research Council in Hungary (reference No.: 29655/2018/EKU). Eligible patients were aged 18 years or over and had a diagnosis of AD confirmed by a dermatologist. Patients completed multiple generic and skin-specific HRQoL measures in a fixed order: Dermatology Life Quality Index (DLQI) [30], EQ-5D-5L (5L) [20], Skindex-16 [31], and EQ-5D-3L (3L) [19]. The 5L was placed before the 3L within the questionnaire to prevent the underuse of the second and fourth levels in the 5L [32]. The EuroQol visual analogue scale (EQ VAS) was completed only once, as part of the 5L.
Measures
A detailed description of the HRQoL measures used in the study is provided in Table 1, including their items, response levels, scoring and interpretation. In addition to HRQoL instruments, patients were asked to assess their level of itching and sleep disturbance for the past 1 month and their current disease severity (PtGA) using 11-point visual analogue scales (VAS). Demographic and medical history data were obtained from patients, including age, sex, education, employment, family history of AD and disease duration. Dermatologists assessed patients’ disease severity using the Investigator Global Assessment (IGA) [33], the objective SCORing Atopic Dermatitis (oSCORAD) [34], and the Eczema Area and Severity Index (EASI) scales [35], and provided information about treatments. These severity scales are widely used in clinical trials, treatment guidelines and core outcome sets [36,37,38]. We used the cut-off values for the interpretation of EASI and oSCORAD scores as published in Chopra et al. [39], and for DLQI as suggested by Hongbo et al. [40, 41].
Statistical analyses
We built on methods established in previous psychometric studies comparing the performance of the 3L and 5L across different healthy and patient populations [25, 27, 28, 32, 42].
Feasibility and ceiling
The feasibility was assessed by comparing the number of missing responses for the two EQ-5D questionnaires. Missing values were not imputed. Due to the two additional response levels, a reduced ceiling was expected in the 5L compared to the 3L. First, we computed the difference in the proportion of respondents scoring no problems (absolute ceiling reduction). Then, we calculated the relative reduction as (ceiling3L-ceiling5L)/ceiling3L. We compared the difference in ceiling between the 3L and 5L using McNemar’s test. The distributions of index scores were visualised using histograms, and the proportion of patients reporting no problems across all five EQ-5D dimensions was calculated to estimate the ceiling.
Agreement
The difference between 3L and 5L index scores was tested by Wilcoxon signed-rank test. The agreement between the 3L and 5L was displayed using a Bland–Altman plot [43], with the mean of the 3L and 5L index scores on the axis x and their difference on axis y. The 95% confidence interval for the difference was calculated as the mean difference ± 1.96 × standard deviation (SD). The points outside the upper and lower limit were considered outliers. We used the intraclass correlation coefficient (ICC) to test parallel forms reliability, which reflects both the agreement and degree of correlation between the two descriptive systems [44]. A two-way random model with absolute agreement was used to estimate ICCs [45]. We classified ICC values as follows: poor: 0–0.39, fair: 0.40–0.59, good: 0.60–0.74 and excellent: 0.75–1 [46]. Good or excellent agreement was expected between 3L and 5L [25].
Redistribution properties
We calculated the proportion of consistent and inconsistent 3L -5L response pairs using cross-tabulations. A 5L response at least two levels away from its 3L pair was considered inconsistent (e.g. respondent chooses severe problems [level 4] on the 5L and some problems [level 2] on the 3L) [32]. To calculate the average size of inconsistency, 3L responses were recoded to a 5L scale (level 13L = level 15L, level 23L = level 35L and level 33L = level 55L) and the following formula was used: |3L-5L| – 1 [32].
Informativity
Informativity reflects the ability of an instrument to discriminate between different levels of health [47]. The informativity of the five dimensions of 3L and 5L was determined using Shannon’s (H′) and Shannon’s evenness (J′) indices [47, 48]. The H′ expresses the absolute information content (the number of possible responses) combined with how evenly the information is distributed across all responses, while J′ represents the evenness of distribution exclusively. Our hypothesis was that the 5L with its two additional levels improves the informativity of the 3L [49]. We calculated the two indices according to the following formulae (L: number of levels in one dimension of the EQ-5D; pi: percentage of patients choosing the ith level):
Higher H′ indicates better informativity (range: 0 to log2L, where log2L is 1.85 for the 3L and 2.32 for the 5L). The value of J′ ranges from 0 to 1, whereby 0 corresponds to the worst discriminatory power, when all responses are in the same response level and 1 indicates the best discriminatory power with even distribution of responses among all levels [25].
Convergent validity
Convergent validity was analysed by calculating Spearman’s rank order correlation coefficients (rs) between the 3L and 5L dimensions and index scores and previously validated other measures. Based on earlier studies, we hypothesized at least moderate correlations between the EQ-5D dimensions and index scores and EQ VAS, DLQI, and Skindex-16 [50], and weak correlations with severity measures, including IGA, oSCORAD, EASI, and PtGA VAS [51]. In general, we expected most EQ-5D dimensions and index scores to correlate weakly or very weakly with sleep disturbance and itching VAS, as these are not parts of the EQ-5D descriptive system [52]. The only exception was itching for which we assumed a moderate correlation with the pain/discomfort dimension [53]. We expected the 5L to be more strongly related to these disease severity and skin-specific HRQoL measures. We interpreted correlation coefficients as follows: very weak < 0.20, weak 0.20–0.39, moderate 0.40–0.59, strong 0.60–0.79 and very strong 0.80 < [54].
Known-group validity
Due to the skewed distribution of EQ-5D index scores, non-parametric Mann–Whitney and Kruskal–Wallis tests were used to assess and compare the ability of 3L and 5L to distinguish between known groups of patients defined by severity scores on IGA, oSCORAD, and EASI or skin-specific HRQoL on DLQI. We hypothesized that patients with higher disease severity or worse skin-specific HRQoL have significantly lower index scores and the 5L is able to better differentiate across known groups.
Effect sizes (ES) were calculated as follows:
where k is the number of groups, and n is the sample size. We interpreted ESs ≥ 0.01 as small, ≥ 0.06 as moderate and ≥ 0.14 as large [55]. Relative efficiency (RE) was computed as the ratio of the ESs of 5L and 3L index scores. A RE larger than 1 indicated that the 5L was more efficient in distinguishing between known groups. Data analysis was carried out in R Statistical Software (v4.1.2 Vienna, Austria) [56]. A p < 0.05 was considered statistically significant and all tests were two-sided.
Results
Overall, 224 adult AD patients were invited to the study, four of whom declined to participate and another two patients did not finish the questionnaire. Thus, a total of 218 patients completed the questionnaire. No respondents were excluded from the data analysis. Demographic and clinical characteristics of patients are summarized in Table 2. Overall, 57.8% were women and the mean age was 31.3 ± 11.7 years (range 18–73). According to oSCORAD, 21.1%, 33.5% and 45.4% had clear/mild, moderate and severe AD, respectively. Nearly two-thirds of the patients (63.3%) were treated by systemic non-biological therapy at the time of the survey, while 23.4% received topical therapy only and a minority (9.6%) were untreated. Patients reported substantial impairment in their skin-specific HRQoL with mean DLQI score of 13.4 ± 8.5 and Skindex-16 total score of 56.8 ± 27.5 (Table 3).
Feasibility and ceiling
There were no missing responses across the 3L or 5L descriptive systems; however, one patient left the EQ VAS blank. A total of 33 different health states were reported on the 3L and 84 on the 5L.
The frequencies and percentages of patients reporting a ceiling are presented in Table 4. A statistically significant reduction in ceiling effect between 5L and 3L was observed in the mobility (4.6%), self-care (11.5%) and usual activities (9.2%) dimensions, while in the anxiety/depression dimension the ceiling slightly increased (2.3%), although the difference between 3L and 5L was insignificant. The largest relative ceiling effect reduction was found for usual activities (17.2%), followed by self-care (13.2%) and pain/discomfort (12.8%). The proportion of patients reporting no problems in each dimension (11111) demonstrated a reduction from 27.5% on the 3L to 22.5% on the 5L (p = 0.029). There were a total of 6 (2.8%) ‘best health you can imagine’ (= 100) responses on the EQ VAS.
Agreement
The distribution of 3L and 5L index scores is shown in Fig. 1. One patient had a negative index score on the 5L, while no negative values were observed on the 3L. The mean 5L index score was lower than that of the 3L, although the difference was insignificant (0.82 ± 0.22 vs. 0.85 ± 0.15, p = 0.928). An overall good agreement was observed between the two measures with an ICC of 0.815 (95% CI 0.758–0.859, p < 0.001). This was confirmed by the Bland–Altman plot in Fig. 2. The differences between 3L and 5L index scores tended to be higher for more severe health states.
Redistribution properties and inconsistencies
The percentages of consistent and inconsistent response pairs in each level of 3L and 5L are shown in Table 5. A total of 64 (5.9%) inconsistent response pairs were reported by 50 (22.9%) patients. The average size of inconsistency was very small (1.09). The highest proportion of inconsistent response pairs (9.2%) and largest average inconsistency (1.15) were present in the anxiety/depression dimension (Table 4). The fewest inconsistent responses occurred in the mobility dimension (1.4%) with an average size of 1.00.
Informativity
The informativity results are provided in Table 4. The 5L increased the absolute (H′) informativity across all dimensions (3L 0.53–1.27 vs. 5L 0.81–1.98) suggesting the usefulness of the two additional response levels in the 5L. Relative informativity (J′) increased for the first four dimensions (3L 0.33–0.74 vs. 5L 0.35–0.85), but not for the anxiety/depression (3L 0.80 vs. 5L 0.73).
Convergent validity
Table 6 shows the correlations between EQ-5D dimensions and index scores with other instruments and scales. The results provide support for most of our hypotheses. The EQ-5D mobility and self-care dimensions showed weak or no correlations with other measures. The usual activities, pain/discomfort and anxiety/depression dimensions were moderately or strongly correlated with the DLQI and Skindex-16 subscale and total scores (rs = 0.429–0.670). The only exception was the symptoms subscale of Skindex-16 that correlated weakly with anxiety/depression. As expected, the itching experienced in the past 1 month exhibited the strongest correlation with the pain/discomfort dimension (3L 0.351 vs. 5L 0.476). Similarly, sleep VAS score for the past 1 month correlated moderately with pain/discomfort (3L 0.381 vs. 5L 0.484). Both itching and sleep VAS showed weak correlations with 3L and moderate correlations with 5L index scores.
Moderate or strong correlations were detected between the EQ-5D index scores and DLQI and Skindex-16 total scores (rs = − 0.731 to − 0.622). Both the 3L and 5L index scores produced strong correlations with the EQ VAS (rs = 0.626 vs. 0.665). Contrary to our hypotheses, weak correlations were observed between index scores and disease severity measured by IGA, oSCORAD, and EASI (rs = − 0.359 to − 0.274). PtGA VAS scores showed moderate correlations with 3L and 5L index scores (rs = − 0.531 vs. − 0.583). With very few exceptions, the 5L demonstrated stronger correlations with all instruments and scales. The difference between the 3L and 5L was particularly pronounced for the pain/discomfort dimension.
Known-group validity
Results on known-group validity analyses are presented in Table 7. Both the 3L and 5L were able to distinguish across predefined groups of patients based on severity and skin-specific HRQoL (i.e. DLQI score bands) with moderate to large effect sizes (0.080–0.489). Patients with more severe disease and worse skin-specific HRQoL had lower EQ-5D index scores (p < 0.001). The 5L more efficiently discriminated across EASI (RE 1.033) and DLQI groups (RE 1.275), while the 3L slightly outperformed 5L in the case of IGA (RE 0.978) and oSCORAD groups (RE = 0.966).
Discussion
This is the first study to compare the psychometric properties of the two adult versions of the EQ-5D in patients with AD. Both the 3L and 5L exhibited overall good psychometric properties in AD; however, the 5L was superior in terms of ceiling effects, informativity and convergent validity. Previously, similar head-to-head 3L vs. 5L comparative studies were carried out in two other chronic inflammatory dermatological conditions, psoriasis and hidradentitis suppurativa also in Hungary that allow for direct comparisons. In line with these prior studies of similar sample size, the 5L resulted in a much richer set of responses with more than twice as many unique health state profiles (psoriasis 86 vs. 30 [27], hidradenitis suppurativa 101 vs. 43 [28], and AD 84 vs. 33). Further similarities across these studies include a substantial relative ceiling effect reduction with the 5L (psoriasis 11.4%, hidradenitis suppurativa 14.6% and AD 18.3%), the low proportion of inconsistent response pairs (psoriasis 3.9%, hidradenitis suppurativa 8.0% and AD 5.9%) and an identical or improved average relative informativity of the 5L (psoriasis 0.61, hidradenitis suppurativa 0.74 and AD 0.64). It seems therefore that the two extra levels of the 5L are effectively used in AD similarly to other chronic dermatological diseases and enable patients to more commonly report health problems.
The improved measurement properties of the 5L descriptive system appear to be translated to the level of utilities as 5L index scores showed stronger correlations with disease severity and skin-specific HRQoL measures in AD in comparison with the 3L. The exceptionally strong correlations of the 5L index scores with Skindex-16 (rs = − 0.684) and DLQI (rs = − 0.731) total scores lend supportive evidence to the excellent validity of the 5L in this patient population. However, validity between known disease severity and skin-specific HRQoL groups was established for both 3L and 5L with negligible difference in effect sizes between the two measures.
Several findings of the present study may be explained by the slightly different wording used in the 3L questionnaire compared to the 5L. Some of these changes affect all language versions (e.g. most severe level of mobility is ‘confined to bed’ in the 3L and ‘unable to walk’ in the 5L) and there are a few variations used solely in the Hungarian versions (e.g. the descriptor ‘anxiety/depression’ in the 5L is ‘anxiety/feeling down’ in the 3L) [29]. This latter modification seems to be responsible for the unexpected psychometric properties of the anxiety/depression dimension, including an increase in ceiling effect (3L 45.4% vs. 5L 47.7%), lower relative informativity of the 5L (3L 0.80 vs. 5L 0.73) and the highest rate of inconsistent response pairs in anxiety/depression (9.2%) among the five dimensions. Similar psychometric properties of the anxiety/depression dimension were reported by other studies from Hungary [27, 28, 57].
Itching is considered a hallmark symptom of AD that may adversely affect patients’ HRQoL, including sleep. It is currently debated to what extent the EQ-5D descriptive system is able to capture itching and sleep problems. A recent study with AD patients found very weak and insignificant correlation between 3L index scores and sleep disturbance as measured on weekly average scores of an 11-point numeric rating scale [58]. In another study with burn patients, the pain/discomfort dimension of the 5L showed moderate correlations with a 10-point itching VAS [53]. Recent qualitative evidence in psoriasis patients also suggests that the discomfort element of the pain/discomfort composite dimension may cover itching to a minor extent [59]. Our findings in AD showed a weak correlation for the 3L and a moderate for the 5L pain/discomfort dimension and index scores with itching and sleep problems. However, a 1-month recall period was used for the itching and sleep VAS, whereas the EQ-5D asks about ‘today’. These results are also relevant for the currently expanding bolt-on research programme for the EQ-5D. Over the past three decades, several additional dimensions (bolt-ons) have been developed for the EQ-5D to improve accuracy and precision of the measure in specific populations [60]. Among them, there is a bolt-on aiming to assess sleep problems and another psoriasis-specific bolt-on with two items, one of which, skin irritation measures the level of itching experienced by the respondent [61, 62].
Another noteworthy finding of this study is that mean index scores were lower in the 5L (5L 0.82 vs. 3L 0.85). As the Hungarian 3L and 5L value sets were developed in a parallel valuation study from a common sample, using the same preference elicitation method and modelling approach, the differences found in index scores reflect the wording differences between the two measures. The difference in mean 3L and 5L index scores was smaller at the top end of the scale near ‘full health’ and there was a widening gap at lower mean index scores. For example, patients with severe AD according to their oSCORAD score had considerably higher mean 3L index score than that in the 5L (0.80 vs. 0.76). In contrast, the difference was much smaller in either the ‘clear’ (0.98 vs. 0.97) or ‘mild’ (0.90 vs 0.91) oSCORAD groups. As a result, an assumed improvement from ‘severe’ to ‘clear’ skin may lead to a mean index score gain of 0.18 with the 3L and 0.21 with the 5L that might guarantee a lower cost-effectiveness ratio with the 5L for the same AD treatment.
The main strengths of the present study include the multicentre design, the diverse patient population in terms of sociodemographic and clinical background and the wide range of validated skin-specific HRQoL instruments and disease severity scales used. Potential limitations include the cross-sectional design that did not allow to assess test–retest reliability and responsiveness of the instruments. Furthermore, most patients were recruited at university clinics, where patients with mild disease may be underrepresented. Lastly, albeit the DLQI and Skindex-16 have been extensively validated in AD patients and are the most widely used HRQoL questionnaires in dermatological conditions [63, 64], these are skin-specific and not condition-specific instruments and their adequacy has been subject to criticism [65,66,67,68,69]. Further research may concentrate on the validation of the EQ-5D against AD-specific HRQoL measures, such as the Quality of Life Index for Atopic Dermatitis (QoLIAD) [70].
In summary, both the 3L and 5L showed an overall good validity in adult AD patients. The superiority of the 5L was confirmed in many aspects, including ceiling effect, informativity and convergent validity. Given the high prevalence and considerable societal burden of AD, our findings fill in an important gap in evidence needed when selecting instruments for economic evaluations. Such analyses have become particularly important with the increasing number of costly new therapies for AD, including biological and small molecule treatments [71,72,73]. Based on our findings, we recommend the use of the 5L to measure health outcomes both in clinical settings and for QALY calculations in adult AD.
References
Langan, S.M., Irvine, A.D., Weidinger, S.: Atopic dermatitis. Lancet (2020). https://doi.org/10.1016/s0140-6736(20)31286-1
Weidinger, S., Beck, L.A., Bieber, T., Kabashima, K., Irvine, A.D.: Atopic dermatitis. Nat. Rev. Dis. Primers (2018). https://doi.org/10.1038/s41572-018-0001-z
Hill, M.K., Kheirandish, P.A., Braunberger, T.L., Armstrong, A.W., Dunnick, C.A.: Recent trends in disease severity and quality of life instruments for patients with atopic dermatitis: a systematic review. J. Am. Acad. Dermatol. (2016). https://doi.org/10.1016/j.jaad.2016.07.002
Birdi, G., Cooke, R., Knibb, R.C.: Impact of atopic dermatitis on quality of life in adults: a systematic review and meta-analysis. Int. J. Dermatol. (2020). https://doi.org/10.1111/ijd.14763
Gabes, M., Tischer, C., Apfelbacher, C.: Measurement properties of quality-of-life outcome measures for children and adults with eczema: an updated systematic review. Pediatr. Allergy Immunol. (2020). https://doi.org/10.1111/pai.13120
Koszorú, K., Borza, J., Gulácsi, L., Sárdy, M.: Quality of life in patients with atopic dermatitis. Cutis 104, 174–177 (2019)
Wollenberg, A., Barbarot, S., Bieber, T., Christen-Zaech, S., Deleuran, M., Fink-Wagner, A., et al.: Consensus-based European guidelines for treatment of atopic eczema (atopic dermatitis) in adults and children: part I. J. Eur. Acad. Dermatol. Venereol. (2018). https://doi.org/10.1111/jdv.14891
Wollenberg, A., Barbarot, S., Bieber, T., Christen-Zaech, S., Deleuran, M., Fink-Wagner, A., et al.: Consensus-based European guidelines for treatment of atopic eczema (atopic dermatitis) in adults and children: part II. J. Eur. Acad. Dermatol. Venereol. (2018). https://doi.org/10.1111/jdv.14888
Bieber, T.: Atopic dermatitis: an expanding therapeutic pipeline for a complex disease. Nat. Rev. Drug Discov. (2021). https://doi.org/10.1038/s41573-021-00266-6
Ariëns, L.F.M., van Nimwegen, K.J.M., Shams, M., de Bruin, D.T., van der Schaft, J., van Os-Medendorp, H., et al.: Economic burden of adult patients with moderate to severe atopic dermatitis indicated for systemic treatment. Acta Derm. Venereol. (2019). https://doi.org/10.2340/00015555-3212
Smith, B.W., Chovatiya, R., Thibau, I.J., Silverberg, J.I.: Financial burden of atopic dermatitis out-of-pocket health care expenses in the United States. Dermatitis (2021). https://doi.org/10.1097/der.0000000000000715
Drucker, A.M., Wang, A.R., Li, W.Q., Sevetson, E., Block, J.K., Qureshi, A.A.: The burden of atopic dermatitis: summary of a report for the National Eczema Association. J. Invest. Dermatol. (2017). https://doi.org/10.1016/j.jid.2016.07.012
Wu, A.C., Fuhlbrigge, A.L., Robayo, M.A., Shaker, M.: Cost-effectiveness of biologics for allergic diseases. J. Allergy Clin. Immunol. Pract. (2021). https://doi.org/10.1016/j.jaip.2020.10.009
Finch, A.P., Brazier, J.E., Mukuria, C.: What is the evidence for the performance of generic preference-based measures? A systematic overview of reviews. Eur. J. Health Econ. (2018). https://doi.org/10.1007/s10198-017-0902-x
Richardson, J., McKie, J., Bariola, E.: Multiattribute utility instruments and their use. In: Culyer, A.J. (ed.) Encyclopedia of health economics, pp. 341–357. Elsevier, San Diego (2014)
Wang, A., Rand, K., Yang, Z., Brooks, R., Busschbach, J.: The remarkably frequent use of EQ-5D in non-economic research. Eur. J. Health Econ. (2021). https://doi.org/10.1007/s10198-021-01411-z
Rencz, F., Gulácsi, L., Drummond, M., Golicki, D., Prevolnik, R.V., Simon, J., et al.: EQ-5D in Central and Eastern Europe: 2000–2015. Qual. Life Res. (2016). https://doi.org/10.1007/s11136-016-1375-6
Kennedy-Martin, M., Slaap, B., Herdman, M., van Reenen, M., Kennedy-Martin, T., Greiner, W., et al.: Which multi-attribute utility instruments are recommended for use in cost-utility analysis? A review of national health technology assessment (HTA) guidelines. Eur. J. Health Econ. (2020). https://doi.org/10.1007/s10198-020-01195-8
EuroQol Group: EuroQol–a new facility for the measurement of health-related quality of life. Health Policy (1990). https://doi.org/10.1016/0168-8510(90)90421-9
Herdman, M., Gudex, C., Lloyd, A., Janssen, M., Kind, P., Parkin, D., et al.: Development and preliminary testing of the new five-level version of EQ-5D (EQ-5D-5L). Qual. Life Res. (2011). https://doi.org/10.1007/s11136-011-9903-x
Yang, Y., Brazier, J., Longworth, L.: EQ-5D in skin conditions: an assessment of validity and responsiveness. Eur. J. Health Econ. (2015). https://doi.org/10.1007/s10198-014-0638-9
Pereira, F.R., Basra, M.K., Finlay, A.Y., Salek, M.S.: The role of the EQ-5D in the economic evaluation of dermatological conditions and therapies. Dermatology (2012). https://doi.org/10.1159/000339865
Tamási, B., Brodszky, V., Péntek, M., Gulácsi, L., Hajdu, K., Sárdy, M., et al.: Validity of the EQ-5D in patients with pemphigus vulgaris and pemphigus foliaceus. Br. J. Dermatol. (2019). https://doi.org/10.1111/bjd.16883
Gergely, L.H., Gáspár, K., Brodszky, V., Kinyó, Á., Szegedi, A., Remenyik, É., et al.: Validity of EQ-5D-5L, Skindex-16, DLQI and DLQI-R in patients with hidradenitis suppurativa. J. Eur. Acad. Dermatol. Venereol. (2020). https://doi.org/10.1111/jdv.16642
Buchholz, I., Janssen, M.F., Kohlmann, T., Feng, Y.S.: A systematic review of studies comparing the measurement properties of the three-level and five-level versions of the EQ-5D. Pharmacoeconomics (2018). https://doi.org/10.1007/s40273-018-0642-5
Janssen, M.F., Bonsel, G.J., Luo, N.: Is EQ-5D-5L better than EQ-5D-3L? A head-to-head comparison of descriptive systems and value sets from seven countries. Pharmacoeconomics (2018). https://doi.org/10.1007/s40273-018-0623-8
Poór, A.K., Rencz, F., Brodszky, V., Gulácsi, L., Beretzky, Z., Hidvégi, B., et al.: Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L in psoriasis patients. Qual. Life Res. (2017). https://doi.org/10.1007/s11136-017-1699-x
Bató, A., Brodszky, V., Gergely, L.H., Gáspár, K., Wikonkál, N., Kinyó, Á., et al.: The measurement performance of the EQ-5D-5L versus EQ-5D-3L in patients with hidradenitis suppurativa. Qual. Life Res. (2021). https://doi.org/10.1007/s11136-020-02732-x
Rencz, F., Brodszky, V., Gulácsi, L., Golicki, D., Ruzsa, G., Pickard, A.S., et al.: Parallel valuation of the EQ-5D-3L and EQ-5D-5L by time trade-off in Hungary. Value Health (2020). https://doi.org/10.1016/j.jval.2020.03.019
Finlay, A.Y., Khan, G.K.: Dermatology Life Quality Index (DLQI)—a simple practical measure for routine clinical use. Clin. Exp. Dermatol. (1994). https://doi.org/10.1111/j.1365-2230.1994.tb01167.x
Chren, M.M., Lasek, R.J., Sahay, A.P., Sands, L.P.: Measurement properties of Skindex-16: a brief quality-of-life measure for patients with skin diseases. J. Cutan. Med. Surg. (2001). https://doi.org/10.1007/bf02737863
Janssen, M.F., Birnie, E., Haagsma, J.A., Bonsel, G.J.: Comparing the standard EQ-5D three-level system with a five-level version. Value Health (2008). https://doi.org/10.1111/j.1524-4733.2007.00230.x
Eichenfield, L.F., Lucky, A.W., Boguniewicz, M., Langley, R.G., Cherill, R., Marshall, K., et al.: Safety and efficacy of pimecrolimus (ASM 981) cream 1% in the treatment of mild and moderate atopic dermatitis in children and adolescents. J. Am. Acad. Dermatol. (2002). https://doi.org/10.1067/mjd.2002.122187
Kunz, B., Oranje, A.P., Labrèze, L., Stalder, J.F., Ring, J., Taïeb, A.: Clinical validation and guidelines for the SCORAD index: consensus report of the European Task Force on Atopic Dermatitis. Dermatology (1997). https://doi.org/10.1159/000245677
Hanifin, J.M., Thurston, M., Omoto, M., Cherill, R., Tofte, S.J., Graeber, M.: The eczema area and severity index (EASI): assessment of reliability in atopic dermatitis. EASI Evaluator Group. Exp. Dermatol. (2001). https://doi.org/10.1034/j.1600-0625.2001.100102.x
Schmitt, J., Spuls, P.I., Thomas, K.S., Simpson, E., Furue, M., Deckert, S., et al.: The Harmonising Outcome Measures for Eczema (HOME) statement to assess clinical signs of atopic eczema in trials. J. Allergy Clin. Immunol. (2014). https://doi.org/10.1016/j.jaci.2014.07.043
Schmitt, J., Langan, S., Deckert, S., Svensson, A., von Kobyletzki, L., Thomas, K., et al.: Assessment of clinical signs of atopic dermatitis: a systematic review and recommendation. J. Allergy Clin. Immunol. (2013). https://doi.org/10.1016/j.jaci.2013.07.008
Rehal, B., Armstrong, A.W.: Health outcome measures in atopic dermatitis: a systematic review of trends in disease severity and quality-of-life instruments 1985–2010. PLoS ONE (2011). https://doi.org/10.1371/journal.pone.0017520
Chopra, R., Vakharia, P.P., Sacotte, R., Patel, N., Immaneni, S., White, T., et al.: Severity strata for Eczema Area and Severity Index (EASI), modified EASI, Scoring Atopic Dermatitis (SCORAD), objective SCORAD, Atopic Dermatitis Severity Index and body surface area in adolescents and adults with atopic dermatitis. Br. J. Dermatol. (2017). https://doi.org/10.1111/bjd.15641
Hongbo, Y., Thomas, C.L., Harrison, M.A., Salek, M.S., Finlay, A.Y.: Translating the science of quality of life into practice: what do dermatology life quality index scores mean? J. Invest. Dermatol. (2005). https://doi.org/10.1111/j.0022-202X.2005.23621.x
Rencz, F., Gergely, L.H., Wikonkál, N., Gáspár, K., Péntek, M., Gulácsi, L., et al.: Dermatology Life Quality Index (DLQI) score bands are applicable to DLQI-Relevant (DLQI-R) scoring. J. Eur. Acad. Dermatol. Venereol. (2020). https://doi.org/10.1111/jdv.16398
Janssen, M.F., Pickard, A.S., Golicki, D., Gudex, C., Niewada, M., Scalone, L., et al.: Measurement properties of the EQ-5D-5L compared to the EQ-5D-3L across eight patient groups: a multi-country study. Qual. Life Res. (2013). https://doi.org/10.1007/s11136-012-0322-4
Bland, J.M., Altman, D.G.: Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1, 307–310 (1986)
Koo, T.K., Li, M.Y.: A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J. Chiropr. Med. (2016). https://doi.org/10.1016/j.jcm.2016.02.012
Shrout, P.E., Fleiss, J.L.: Intraclass correlations: uses in assessing rater reliability. Psychol. Bull. (1979). https://doi.org/10.1037//0033-2909.86.2.420
Cicchetti, D.: Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instrument in psychology. Psychol. Assess. (1994). https://doi.org/10.1037/1040-3590.6.4.284
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Tech. J. (1948). https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
Shannon, C.E., Weaver, W.: The mathematical theory of communication. The University of Illinois Press, Urbana (1949)
Bas Janssen, M.F., Birnie, E., Bonsel, G.J.: Evaluating the discriminatory power of EQ-5D, HUI2 and HUI3 in a US general population survey using Shannon’s indices. Qual. Life Res. (2007). https://doi.org/10.1007/s11136-006-9160-6
Vilsbøll, A.W., Kragh, N., Hahn-Pedersen, J., Jensen, C.E.: Mapping Dermatology Life Quality Index (DLQI) scores to EQ-5D utility scores using data of patients with atopic dermatitis from the National Health and Wellness Study. Qual. Life Res. (2020). https://doi.org/10.1007/s11136-020-02499-1
Hsieh, B.J., Shen, D., Hsu, C.J., Chan, T.C., Cho, Y.T., Tang, C.H., et al.: The impact of atopic dermatitis on health-related quality of life in Taiwan. J. Formos. Med. Assoc. (2021). https://doi.org/10.1016/j.jfma.2021.03.024
Shah, K.K., Mulhern, B., Longworth, L., Janssen, M.F.: Views of the UK general public on important aspects of health not captured by EQ-5D. Patient (2017). https://doi.org/10.1007/s40271-017-0240-1
Spronk, I., Bonsel, G.J., Polinder, S., van Baar, M.E., Janssen, M.F., Haagsma, J.A.: Exploring the relation between the EQ-5D-5L pain/discomfort and pain and itching in a sample of burn patients. Health Qual. Life Outcomes (2020). https://doi.org/10.1186/s12955-020-01394-0
Evans, J.D.: Straightforward statistics for the behavioral sciences. Brooks/Cole Publishing, Pacific Grove (1996)
Cohen, J.: Statistical power analysis for the behavioral sciences. Routledge, Abingdon (1988)
R Core Team (2021). R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 2 Nov 2021
Rencz, F., Lakatos, P.L., Gulácsi, L., Brodszky, V., Kürti, Z., Lovas, S., et al.: Validity of the EQ-5D-5L and EQ-5D-3L in patients with Crohn’s disease. Qual. Life Res. (2019). https://doi.org/10.1007/s11136-018-2003-4
Puelles, J., Fofana, F., Rodriguez, D., Silverberg, J.I., Wollenberg, A., Dias, B.C., et al.: Psychometric validation and responder definition of the sleep disturbance numerical rating scale in moderate-to-severe atopic dermatitis. Br. J. Dermatol. (2021). https://doi.org/10.1111/bjd.20783
Rencz, F., Mukuria, C., Bató, A., Poór, A.K., Finch, A.P.: A qualitative investigation of the relevance of skin irritation and self-confidence bolt-ons and their conceptual overlap with the EQ-5D in patients with psoriasis. Qual Life Res (2022). [In press]
Geraerds, A., Bonsel, G.J., Janssen, M.F., Finch, A.P., Polinder, S., Haagsma, J.A.: Methods used to identify, test, and assess impact on preferences of bolt-ons: a systematic review. Value Health (2021). https://doi.org/10.1016/j.jval.2020.12.011
Yang, Y., Rowen, D., Brazier, J., Tsuchiya, A., Young, T., Longworth, L.: An exploratory study to test the impact on three “bolt-on” items to the EQ-5D. Value Health (2015). https://doi.org/10.1016/j.jval.2014.09.004
Swinburn, P., Lloyd, A., Boye, K.S., Edson-Heredia, E., Bowman, L., Janssen, B.: Development of a disease-specific version of the EQ-5D-5L for use in patients suffering from psoriasis: lessons learned from a feasibility study in the UK. Value Health (2013). https://doi.org/10.1016/j.jval.2013.10.003
Higaki, Y., Kawamoto, K., Kamo, T., Ueda, S., Arikawa, J., Kawashima, M.: Measurement of the impact of atopic dermatitis on patients’ quality of life: a cross-sectional and longitudinal questionnaire study using the Japanese version of Skindex-16. J. Dermatol. (2004). https://doi.org/10.1111/j.1346-8138.2004.tb00640.x
Zachariae, R., Zachariae, C., Ibsen, H., Mortensen, J.T., Wulf, H.C.: Dermatology life quality index: data from Danish inpatients and outpatients. Acta Derm. Venereol. (2000). https://doi.org/10.1080/000155500750012153
Twiss, J., Meads, D.M., Preston, E.P., Crawford, S.R., McKenna, S.P.: Can we rely on the Dermatology Life Quality Index as a measure of the impact of psoriasis or atopic dermatitis? J. Invest. Dermatol. (2012). https://doi.org/10.1038/jid.2011.238
Rencz, F., Poór, A.K., Péntek, M., Holló, P., Kárpáti, S., Gulácsi, L., et al.: A detailed analysis of ‘not relevant’ responses on the DLQI in psoriasis: potential biases in treatment decisions. J. Eur. Acad. Dermatol. Venereol. (2018). https://doi.org/10.1111/jdv.14676
Rencz, F., Gulácsi, L., Péntek, M., Poór, A.K., Sárdy, M., Holló, P., et al.: Proposal of a new scoring formula for the Dermatology Life Quality Index in psoriasis. Br. J. Dermatol. (2018). https://doi.org/10.1111/bjd.16927
Rencz, F., Mitev, A.Z., Szabó, Á., Beretzky, Z., Poór, A.K., Holló, P., et al.: A Rasch model analysis of two interpretations of ‘not relevant’ responses on the Dermatology Life Quality Index (DLQI). Qual. Life Res. (2021). https://doi.org/10.1007/s11136-021-02803-7
Rencz, F., Brodszky, V., Gulácsi, L., Péntek, M., Poór, A.K., Holló, P., et al.: Time to revise the Dermatology Life Quality Index scoring in psoriasis treatment guidelines. J. Eur. Acad. Dermatol. Venereol. (2019). https://doi.org/10.1111/jdv.15537
Whalley, D., McKenna, S.P., Dewar, A.L., Erdman, R.A., Kohlmann, T., Niero, M., et al.: A new instrument for assessing quality of life in atopic dermatitis: international development of the Quality of Life Index for Atopic Dermatitis (QoLIAD). Br. J. Dermatol. (2004). https://doi.org/10.1111/j.1365-2133.2004.05783.x
Kuznik, A., Bégo-Le-Bagousse, G., Eckert, L., Gadkari, A., Simpson, E., Graham, C.N., et al.: Economic evaluation of dupilumab for the treatment of moderate-to-severe atopic dermatitis in adults. Dermatol. Ther. (Heidelb). (2017). https://doi.org/10.1007/s13555-017-0201-6
Calabrese, L., Malvaso, D., Chiricozzi, A., Tambone, S., D’Urso, D.F., Guerriero, C., et al.: Baricitinib: therapeutic potential for moderate to severe atopic dermatitis. Expert Opin. Investig. Drugs (2020). https://doi.org/10.1080/13543784.2020.1800639
Zimmermann, M., Rind, D., Chapman, R., Kumar, V., Kahn, S., Carlson, J.: Economic evaluation of dupilumab for moderate-to-severe atopic dermatitis: a cost-utility analysis. J. Drugs Dermatol. 17, 750–756 (2018)
Acknowledgements
The authors are thankful to Györgyi Pónyai, Júlia Borza, Katalin Bodai, Kincső Blága, László Gulácsi and Márta Péntek for their help in the patient recruitment or questionnaire design.
Funding
Open access funding provided by Corvinus University of Budapest. Zsuzsanna Beretzky’s work was supported by the Higher Education Institutional Excellence Program 2020 of the Ministry of Innovation and Technology in the framework of the Financial and Public Services research project (TKP2020-IKA-02) at the Corvinus University of Budapest. Fanni Rencz’s work was supported by the János Bolyai Research Scholarship of the Hungarian Academy of Sciences (BO/00304/21) and the New National Excellence Program of the Ministry for Innovation and Technology from the source of the National Research, Development and Innovation Fund (ÚNKP-21-5-CORVINUS-134).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
Anikó Kovács has received consultancy fees and honoraria for lecturing from Abbvie and Sanofi Genzyme and support for attending meetings and/or travel from Sanofi Genzyme. Miklós Sárdy has received consultancy fees and honoraria for lecturing from Lilly and support for attending meetings including travel and participation on an advisory board from Janssen, Novartis, Egis and Sanofi. Fanni Rencz is a member of the EuroQol Group. Views expressed in the article are those of the authors and are not necessarily those of the EuroQol Research Foundation.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Koszorú, K., Hajdu, K., Brodszky, V. et al. Comparing the psychometric properties of the EQ-5D-3L and EQ-5D-5L descriptive systems and utilities in atopic dermatitis. Eur J Health Econ 24, 139–152 (2023). https://doi.org/10.1007/s10198-022-01460-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10198-022-01460-y