Abstract
Background
Rare diseases negatively impact patients’ quality of life, but the estimation of health state utility values (HSUVs) in research studies and cost–utility models for health technology assessment is challenging.
Objectives
This study compared the methods for estimating the HSUVs included in manufacturers’ submissions of orphan drugs to the National Institute for Health and Care Excellence (NICE) with those of published studies addressing the same rare diseases to understand whether manufacturers fully exploited the existing literature in developing their economic models.
Methods
All NICE Technology Appraisal (TA) and Highly Specialized Technologies (HST) guidance documents of non-cancer European Medicines Agency (EMA) orphan medicinal products were reviewed and compared with any published primary studies, retrieved via PubMed until November 2020, and estimating HSUVs for the same conditions addressed in manufacturers’ submissions.
Results
We identified 22 NICE TA/HST appraisal reports addressing 19 different rare diseases. Sixteen reports presented original HSUVs estimated using EQ-5D or Health Utility Index (n = 12), direct methods (n = 2) or mapping (n = 2), while the other six included values obtained from the literature only. In parallel, we identified 111 published studies: 86.6% used preference-based measures (mainly EQ-5D, 60.7%), 12.5% direct techniques, and 2.7% mapping. The collection of values from non-patient populations (using ‘vignettes’) was more frequent in manufacturers’ submissions than in the literature (22.7% vs. 8.0%).
Conclusions
The agreement on methodological choices between manufacturers’ submissions and published literature was only partial. More efforts should be made by manufacturers to accurately reflect the academic literature and its methodological recommendations in orphan drugs submissions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Background
Rare diseases (RDs) are defined by the European Medicines Agency (EMA) as diseases with a prevalence of fewer than 5 in 10,000 people and affect around 30 million individuals in the European Union (https://ec.europa.eu). These diseases are often severe, life-threatening and/or disabling, characterized by an early onset. As such, they negatively affect patients’ and their carers’ quality of life (QoL) [1], which should be appropriately account for when assessing the benefit of a new medicine. This is usually done via consideration of patient-reported outcome (PRO) data and/or health state utility values (HSUVs).
According to the Food and Drug Administration (US), PROs are defined as ‘any report of the status of a patient’s health condition that comes directly from the patient, without interpretation of the patient’s response by a clinician or anyone else’, and a patient-reported outcome measure (PROM) as an instrument, usually a questionnaire, that captures PRO data from patients (or proxies) to measure disease impact or drug effects, in a clinical trial or other study [2, 3]. For some PROMs, known as preference-based measures, an algorithm has been developed that converts patient responses into HSUVs based on preferences for specific health states derived from the instrument’s dimensions. HSUVs are single numerical values expressing preference-weighted QoL for particular health states and scored on a scale ranging from 0 (for a state equivalent to ‘dead’) to 1 (for a state equivalent to ‘full health’) [4], although some instruments provide also negative values for health states considered worse than death [5]. Among their applications, they are used for calculating the quality-adjusted life year (QALY), which combines HSUVs and survival data in a single metric and is one of the most preferred outcome measures for health technology assessment (HTA) [4].
A number of methods have been developed to estimate HSUVs, including direct and indirect approaches (i.e., preference-based PROMs). The most common direct techniques include standard gamble (SG) and time trade-off (TTO), where respondents are asked to choose between life with a lower QoL, and life in ‘full health’ with a risk of immediate death (in SG) or a shorter length (in TTO) [6]. A simple approach is represented by the visual analogue scale (VAS), where individuals are asked to rate health by selecting a value on scale where 0 is the worst state they can imagine and 10 or 100 is the best [4]. In recent years, discrete choice experiments (DCEs) have become frequently used to generate HSUVs by asking individuals to choose between hypothetical health states to elicit their preferred health state and the relative weights for various attributes included within health states [7]. The use of an indirect approach through preference-based PROMs is increasingly common and encouraged by many HTA bodies. Six main preference-based generic measures for use in HTA were identified by a recent review [8]: the EuroQol 5-Dimension (EQ-5D), the Health Utility Index Mark 2 or Mark 3 (HUI2/3), the Short Form 6 Dimension (SF-6D), the Quality of Well-Being (QWB) scale, the 15D, and the Assessment of Quality of life (AQoL). In the United Kingdom, the National Institute for Health and Care Excellence (NICE) recommends the use of EQ-5D converted to QALYs to measure the added benefit of new drugs [9]. In detail, companies, academic groups and others preparing evidence submissions for NICE should use the EQ-5D-3L value set for reference-case analyses. If data were gathered using the more recent EQ‑5D‑5L descriptive system, utility values should be calculated by mapping the 5L data onto the 3L value set, until a new high-quality 5L value set for England becomes available [10]. A further approach that is gaining consensus in research and HTA, also accepted by NICE in the absence of EQ-5D data [11], is ‘mapping’ from non-preference-based measures onto preference-based ones (e.g., EQ-5D) using an algorithm previously developed.
The estimation of HSUVs in RDs, however, is challenging. This is because direct techniques may be too demanding for patients, who are often young children or mentally disabled, while generic preference-based instruments may not capture all the relevant symptoms and disabilities, and ‘vignettes’ that describe standard hypothetical health states to non-patient populations may not reflect the heterogeneous manifestations of these diseases [12]. Similarly, the use of mapping in RDs is characterized by difficulties in recruiting sufficiently large samples needed to conduct the regression analysis to derive the algorithm, the limited overlap between the domains included in the disease-specific and generic PROMs, and the poor applicability of algorithms developed for non-rare conditions [13]. Disease-specific preference-based PROMs, which are more sensitive measures, exist for only few RDs (e.g., the Amyotrophic Lateral Sclerosis Utility Index, ALSUI) [14]. Caregiver QoL is increasingly considered in HTA, and several carer-specific preference-based instruments have been developed (e.g., ASCOT-Carer and CarerQol-7D) [15, 16]. However, their use and consideration in RDs remain limited [17, 18], even though most diseases affect young children impacting the QoL of those caring for them.
Work Package 10 of the EU Horizon 2020 project IMPACT-HTA (www.impact-hta.eu) aimed to investigate and develop guidance on use of PRO data and HSUVs in RD treatments for HTA. Results suggested PROs and HSUVs sometimes fail to demonstrate change in symptomatology or capture dimensions that really matter to patients, and their estimates are often uncertain and/or of poor quality due to insufficient evidence. The research also pointed to the likely impact of the nature of RDs on the methodological limitations identified, e.g., collecting PROs from paediatric, cognitively impaired, or heterogeneous populations [12, 19].
When preparing their HTA submissions, manufacturers usually review the available literature to identify and derive the parameters for their economic evaluations and/or to learn about methods, such as existing techniques to estimate HSUVs. Given the poor quality of the QoL evidence included in many HTA submissions [18], the question is whether manufacturers are making full use of the available literature, or whether the poor quality might be a consequence of the intrinsic limitations in applying the existing approaches to PRO and HSUV data generation in RDs. In order to answer this question, this paper aimed to further examine the approaches used to derive the HSUVs used in HTA submissions and compare them with the corresponding methodologies and recommendations arising from the literature. This was done by reviewing and comparing the methods used by manufacturers to derive HSUVs in NICE’s appraisal reports of orphan drugs with all published studies addressing the same RDs to understand whether manufacturers fully exploited the existing literature in developing their economic models and derive some related methodological learnings.
Methods
All treatments with an EMA orphan designation and appraised by NICE within their Technology Appraisal (TA) and Highly Specialized Technologies (HST) programmes until June 2020 were included in the study. Specialized or selected high-cost low-volume treatments for very rare conditions are typically evaluated within the HST procedure, while all other treatments undergo the TA process [20]. We excluded appraisal reports for cancer indications given that QALYs are more likely to be driven by survival rather than QoL increases, and any reports for conditions that are not included in the ORPHANET list of RDs [21].
The publicly available appraisal reports were downloaded from NICE’s website (https://www.nice.org.uk/). These reports include a summary of the evidence submitted by the manufacturer, a review of the manufacturer’s submission by an independent Evidence Review Group (ERG), and the Appraisal Committee’s appraisal of the evidence and final decision. The following information was extracted about the manufacturer’s approach(es) embraced to derive the HSUVs used in the economic model(s): type of technique(s) used, number and type of respondents, and literature sources consulted. Subsequent comments, criticisms, and suggestions made by the ERG and/or Committee relating to the techniques used and HSUVs results were also extracted and summarized in order to gain a better understanding of NICE’s opinion of these approaches or of what they would consider reasonable to obtain HSUVs.
We then performed a scoping review of published studies following the Preferred Reporting Items for Systematic reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) checklist [22]. The scoping review was considered appropriate to synthesize the literature as this approach is suggested to map evidence and investigate how research is conducted on a certain topic (i.e., in this study, the estimation of HSUVs in rare diseases) [22, 23]. The aim was to retrieve all primary studies estimating HSUVs in the same indications addressed by the sample of NICE reports identified. An example of the research string used in PubMed is reported below for one NICE appraisal assessing Burosumab for X-linked hypophosphatemia (HST8 [24]):
((Burosumab)[Title/Abstract] OR (X-linked hypophosphatemia)[Title/Abstract]) AND ((health state utility values)[Title/Abstract] OR (utility values)[Title/Abstract] OR (health utilities)[Title/Abstract] OR (preference weights)[Title/Abstract] OR (index values)[Title/Abstract] OR QALYs[Title/Abstract] OR (cost-utility)[Title/Abstract] OR EQ-5D[Title/Abstract] OR EuroQol[Title/Abstract] OR HUI[Title/Abstract] OR (Health Utility Index)[Title/Abstract] OR QWB[Title/Abstract] OR SF-6D[Title/Abstract] OR 15D [Title/Abstract]).
The same string was applied to every drug-disease combination addressed by the NICE reports identified; drugs for the same indication were included in a single string (the full search strategy is reported in Supplementary File 1). The electronic searches were conducted until November 2020. The ScHARRHUD database (https://www.scharrhud.org/) holding bibliographic details of studies reporting HSUVs was also searched. All search results were extracted into an Excel spreadsheet and duplicates removed. Titles and abstracts were screened by two independent reviewers and records excluded if they did not meet the inclusion criteria; full-text papers were retrieved in case of doubtful results. Any disagreement was solved by discussion until a consensus was reached.
The studies deemed eligible for inclusion were those presenting an original approach for estimating HSUVs in any of the conditions of interest, i.e., we excluded those using estimates from previous studies already in the literature. Studies addressing multiple conditions (e.g., post lung transplantation patients) were included provided that at least one subsample of observations was related to the condition of interest (e.g., cystic fibrosis). Studies of any design (i.e., cross-sectional surveys, cohort studies, clinical trials, and cost–utility models) could be included in the review, provided that they showed an original approach to deriving HSUVs. Studies in languages other than English, conference abstracts and study protocols were excluded. Literature reviews were excluded but their reference lists were manually checked in order to identify any additional original studies not captured by the online searches. Similarly, economic evaluations using the published literature to obtain utility parameters for their analyses were excluded, but their reference lists were checked to avoid missing any relevant publications.
For each study, we extracted detailed information regarding the study characteristics (i.e., study design, setting, country, type and number of participants), the method(s) adopted to estimate HSUVs (e.g., direct approaches, such as SG or TTO, indirect approaches using preference-based instruments, such as EQ-5D or mapping), actual HSUVs estimates for the study sample and relevant subgroups, and authors’ methodological considerations regarding the approaches used to estimate HSUVs (if reported). The data extraction form (in Excel) was piloted in parallel by two reviewers on a sample of ten studies, and subsequently refined and completed independently by the first author. The study characteristics were tabulated and subsequently summarized through descriptive statistics (i.e., frequency distribution).
Then, the methods used by manufacturers to derive HSUVs, as reported in the NICE documents reviewed, were compared with those used in the published literature for each indication considered. Specifically, we aimed to understand whether manufacturers relied on the published studies (in the public domain at least one year before the NICE appraisal date) to retrieve HSUVs, or to replicate the techniques adopted to derive HSUVs. Otherwise, we sought to understand if divergences from the existing literature was motivated by the limitations highlighted by study authors’ around the technique(s) adopted and the results obtained. In addition, based on these authors’ methodological considerations, we summarized the key learnings that are important to account for when using each of the available methods to derive HSUVs in individual RDs, and whether the manufacturer’s approach could have better reflected any methodological advice in the existing literature.
Results
Synthesis of NICE technology appraisals
Of the 48 appraisal reports identified from the NICE website, we excluded 24 for cancer indications and two additional reports for Crohn’s and cytomegalovirus disease, since these conditions are not included in the ORPHANET list of RDs. The final sample included 22 TA/HST reports [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45] for 19 different indications (one indication could have several treatments appraised), published between 2012 and 2019 (Table 1). Across these 22 manufacturer submissions, 16 (72.7%) derived their own HSUVs. Of these, 12 presented original data collection using a preference-based instrument (i.e., EQ-5D in 10 reports, HUI2 in one report, and HUI3 together with EQ-5D in another one). In eight of these (HST1, HST10, TA266, TA398, HST2, TA431, TA491, TA379), questionnaires were filled in by patients within a clinical study, while in the remaining four (HST6, HST8, HST11, HST12) questionnaires were administered to clinical experts to value hypothetical health state descriptions (i.e., ‘vignettes’). In two reports (TA276 and HST5), HSUVs were derived by mapping non-preference-based measures (Cystic Fibrosis Questionnaire, CFQ and Short Form-36, SF-36) onto EQ-5D. In one report (HST4), HSUVs were obtained through a discrete choice experiment (DCE), and in another (TA467) through a SG exercise, both performed with the general public. Finally, six reports (HST3, HST7, HST9, TA443, TA588, TA606) did not perform any empirical study, and relied only on literature values and/or expert opinion to obtain HSUVs (Fig. 1).
In total, the use of ‘vignettes’ was reported in five cases (22.7%), of which four (HST6, HST8, HST11, HST12) using EQ-5D or HUI2/3, and one (TA467) SG, and ten reports (HST9, TA266, TA398, HST2, HST7, TA588, TA606, TA443, TA491, HST3) referred to published studies to obtain some or all HSUVs for their analyses. Cumulatively, 15 cases (68.2%) reported EQ-5D utilities obtained with different approaches: seven (HST1, HST2, HST10, TA379, TA398, TA431, TA491) collecting data directly from patients, four (HST6, HST8, HST11, HST12) using ‘vignettes’ to be valued by clinical experts, two (HST9, TA606) referring to published studies, and other two (HST5, TA276) using ‘mapping’.
The Committees and/or the ERG appraised six cases positively, where the HSUVs were derived either from EQ-5D data collected within a trial, or from different approaches considered to be reasonable (HST1, TA398, TA606, TA443, TA491, TA379). In 10 cases (HST2, HST3, HST5, HST6, HST8, HST9, HST11, HST12, TA431, TA588), the Committees highlighted several limitations in the approach used by the manufacturer (i.e., surveys of clinical experts instead of patients, small samples, EQ-5D value sets from other countries, data collected within observational studies instead of trials, vignettes with unclear disease elements) but recognized that these could be acceptable with some adjustments and/or cautious consideration of results. In the remaining six reports (HST4, HST10, TA266, TA276, HST7, TA467), the Committees were more sceptical about the approach adopted, considering that it might yield unrealistic HSUVs estimates. Consequently, they often preferred to rely on alternative de novo approaches developed by the ERG.
Literature search results
In total, the search strategy in PubMed identified 7445 articles. After removing 190 duplicates, 7255 records were scanned for title/abstract and 7105 were excluded in this first phase. Subsequently, 150 full-text articles were retrieved, and a further 68 records were excluded for not complying with the inclusion criteria. The two main reasons for exclusion were that the study evaluated QoL but did not estimate HSUVs, or that the study used HSUVs from the literature. Accordingly, 82 studies were selected, plus 29 studies identified through manual searches, resulting in a total of 111 studies included in the review [46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,66,67,68,69,70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88,89,90,91,92,93,94,95,96,97,98,99,100,101,102,103,104,105,106,107,108,109,110,111,112,113,114,115,116,117,118,119,120,121,122,123,124,125,126,127,128,129,130,131,132,133,134,135,136,137,138,139,140,141,142,143,144,145,146,147,148,149,150,151,152,153,154,155,156]. No additional articles were obtained from the ScHARRHUD database (Fig. 2).
Synthesis of published studies
The 111 studies estimated HSUVs in the same RDs addressed by the NICE reports (Tables 2, 3). One study [152] was reported twice since it addressed two different conditions, thus leading to the consideration of 112 different studies. At least one study was identified for each RD addressed in the NICE reports, with the exceptions of paediatric-onset hypophosphatasia and neuronal ceroid lipofuscinosis. The highest number of publications were retrieved for cystic fibrosis (n = 36, 32.1%), idiopathic pulmonary fibrosis (n = 16, 14.3%), and Duchenne muscular dystrophy (DMD, n = 10, 8.9%).
In terms of study location, 27 (24.1%) were international studies involving two or more countries. Among the single-country studies, 23 studies (20.5%) were conducted in the UK, 22 (19.6%) in the US, eight (7.1%) in the Netherlands, six (5.4%) in Sweden, five (4.5%) in Canada, three (2.7%) in Australia, Finland and Germany respectively, two (1.8%) in France, Spain and Japan respectively, and one (0.9%) in Switzerland, Ireland, Bulgaria, Portugal respectively; in two cases (1.8%), the study location was not specified. The great majority of studies (n = 78, 70%) were published after 2010 (Fig. 3). Over half of the studies (n = 64, 57.1%) were aiming to evaluate QoL using preference-based questionnaires that (involuntarily) also allowed to derive HSUVs; of these studies, 22 (19.6%) were clinical studies using QoL as secondary endpoint, 30 (26.8%) were primarily aiming to estimate QoL, and 12 (10.7%) were assessing QoL and economic burden together. Only 15 (13.4%) explicitly aimed to estimate HSUVs. The remaining aimed to validate existing questionnaires in a specific patient population (n = 16, 14.3%), to calculate the cost-effectiveness of treatments (n = 14, 12.5%) or to quantify caregiver burden (n = 3, 2.7%).
The majority of studies (n = 72, 64.3%), all published from 2000 onward, used one of the EuroQol instrument versions (i.e., EQ-5D-3L, EQ-5D-5L or EQ-5D-Y) to estimate HSUVs. The QWB scale and HUI 2/3 were chosen as preference-based instruments by 11 (9.8%) and 14 (12.5%) studies respectively, although the former was never used after 2005, while the latter was more equally distributed over time. Cumulatively, 97 studies (86.6%) used at least one preference-based instrument. Direct utility estimation methods were less frequently adopted: 11 (9.8%) studies used SG, ten (8.9%) used visual analogue scale, seven (6.3%) used TTO, and one (0.9%) the risk–risk trade-off, for a total of 14 studies (12.5%) using at least one of these approaches. In three studies (2.7%), caregiver utility was estimated through the Carer-QoL-7D/-VAS. In three studies (2.7%) HSUVs were mapped from non-preference-based measures, while in other three (2.7%) the authors developed an original approach to obtain HSUVs, such as adjusting data from the literature.
The great majority of studies (n = 96, 85.7%) surveyed patients to obtain HSUVs; when the patient was a child, the parent/caregiver support or proxy-reporting was used by 11 (9.8%) studies. Caregivers also reported on their own health in 12 studies (10.7%) which aimed at quantifying the caregiver’s burden, exclusively or in addition to patient’s burden. Finally, very few studies recruited other types of responders such as clinical experts (n = 6, 5.4%) or the general public (n = 5, 4.5%) to derive HSUVs using proxy reporting (e.g., paediatricians valuing their patient’s health) or ‘vignettes’. In particular, the use of health state descriptions or case studies/histories (‘vignettes’) was reported by 9 studies only (8.0%). The sample size was classified as small (< 100 participants) in 42% of the studies (n = 47).
Synthesis of authors’ methodological considerations
Overall, 57 (out of 111) studies retrieved from the literature discussed the pros/cons of the techniques adopted to estimate HSUVs. Their authors’ methodological considerations are reported verbatim in Table 2. Of these, 22 (38.6%) were studies assessing QoL through preference-based PROMs (with or without economic burden), 14 (24.6%) estimating HSUVs as specific objective, and 13 (22.8%) assessing the validity or feasibility of questionnaires. This section presents a synthesis of these authors’ statements for each rare disease covered by the literature review, with the exception of three (i.e., atypical haemolytic uremic syndrome, severe combined immunodeficiency, and Waldenstrom’s macroglobulinaemia), for which no studies commented on the methods to estimate HSUVs.
Cystic fibrosis
Of the 22 studies providing methodological considerations, half (n = 11) commented on the use of the EQ-5D questionnaires (i.e., EQ-5D-3L/-5L and EQ-5D-Y) to estimate HSUVs. The overall evaluation was positive in seven studies [49, 50, 55, 59, 66, 67, 75], where EQ-5D was described as a valid, easy, and cheap generic instrument reflecting disease progression and severity, as well as allowing cross-disease comparisons and country-specific cost-effectiveness analyses. Conversely, four studies [48, 57, 81, 139] revealed its high ceiling effects and poor sensitivity in capturing symptoms (particularly, the self-care domain was considered irrelevant), and encouraged the use of alternative generic instruments (e.g., HUI3 and SF-6D), or, preferably, the development of a disease-specific preference-based measure (also including a respiratory domain).
The remaining 11 studies advised on alternative techniques to derive HSUVs. The judgment on QWB was mixed, with some studies [69, 74, 116] highlighting its low correlation with other measures of physical and emotional functioning, poor sensitivity in detecting changes in functional domains, or the limitations of its scoring system, while others [116, 121, 122] arguing that it showed responsiveness and validity in tracking changes in patients’ health, even during pulmonary exacerbations, and significantly correlated with measures of performance and pulmonary function. Similarly, the HUI was found to include relevant dimensions for cystic fibrosis patients, such as hearing and fertility [91, 153], but failing to discriminate between severity levels, and too difficult for paediatric patients to complete, thus requiring parental proxy-reporting [125, 153]. The only study [77] using CarerQol-7D showed that this instrument is easy and presents some external validity. Finally, two studies adopting direct methods to estimate HSUVs reached opposite conclusions, as one [62] argued that SG tends to produce higher values than other techniques, whereas the other [148] argued that, assuming individuals are neutral to risk, it is likely to underestimate utilities in patients that are keener to accept risk, such as those waiting for lung transplantation. In one study [116], the small number of patients (n = 20) limited to investigate the responsiveness, validity and reproducibility of the instruments adopted.
Idiopathic pulmonary fibrosis
The judgment on EQ-5D was overall positive, with several studies [58, 85, 142, 143] reporting that the instrument was sensitive to short-term health status changes and comparable to disease-specific questionnaires (e.g., K-BILD), and also allowed comparison with different lung diseases and the general population. However, EQ-5D also presented a significant ceiling effect in mild disease and may not fully capture disease-specific issues [58, 143]. Similarly, the QWB was considered not sensitive enough in distinguishing between heterogeneous disease’s manifestations and not correlated with physiological measures (e.g., dyspnoea scale) [65, 145].
Duchenne muscular dystrophy
The authors’ considerations on preference-based PROMs were generally negative since both EQ-5D and HUI lacked sufficient sensitivity to be used in mobility impaired patients [60]. In particular, the EQ-5D has no consideration for mobility beyond walking, thus underestimating the utility reduction for DMD children [60] and was considered insensitive in this condition [99]. The HUI did not correlate with caregiver’s rating of the patient’s health [100], and given its generic nature, was not considered to be applicable to patients with heterogeneous DMD presentations [97]. Conversely, one study [101] reported that the Duchenne Muscular Dystrophy Functional Ability Self-Assessment (DMDSAT) mapped well onto HUI in this patient population.
Other diseases
In Fabry disease, three studies highlighted the limitations of the use of EQ-5D-3L, such as its inability to detect small health changes, by having only three levels per domain [51], and to capture real health state changes, rather than patient’s perceptions of their health state [88]. Moreover, because of disease’s rarity, it was necessary to pool questionnaires in different languages to obtain a sufficiently large sample [128].
Conversely, two studies performed in hereditary transthyretin amyloidosis argued that EQ-5D showed validity and appropriateness, in spite of difficulties due to disease rarity and small samples [89, 119].
The three studies addressing inherited retinal dystrophies (IRDs) used very different approaches. One study [61] argued that TTO is easier to understand by patients compared to SG, and that the latter also overestimates risk aversion, since for each visual acuity stage a lower percentage of patients was willing to risk death (SG) than trade time of life (TTO) to return to perfect vision. The second study [70] reported that EQ-5D showed significant ceiling effects and may not be suitable for patients with genetic eye diseases, despite presenting the advantage of allowing comparison among alternative conditions. The third [107] reported that, due to the ultra-rarity of the disease, the study could not recruit patients and instead developed vignettes to describe IRDs health states, which were subsequently evaluated by six retina specialists using EQ-5D-5L. The authors’ judgment on this approach was overall positive, since descriptions of IRDs may not be easily understood by the public and the qualitative research that informed vignettes to be valued by clinical experts supported the face validity of the HSUVs obtained.
Among the studies performed in mucopolysaccharidosis type Iva (also known as Morquio A), one [86] reported that EQ-5D-5L was not sensitive enough in assessing pain severity if compared with the Brief Pain Inventory, while another [126] admitted having used it, because it was recommended by international guidelines. A further study [87] showed that medical experts (n = 6) and students (n = 93), used as surrogate respondents for patients who are often children with intellectual disability, reported comparable results using direct techniques. The authors also argued that indirect methods (e.g., EQ-5D) could be more practical with larger samples, although their validity in mucopolysaccharidoses was not demonstrated yet.
The only study [80] addressing X-linked hypophosphatemia reported that EQ-5D-5L may not capture disease-specific problems and that preferences in patients with rare bone diseases may not be aligned with those of the general population, which informed the instrument’s valuation system.
In spinal muscular atrophy (SMA), one study [56] discussed the pros and cons of using HUI3, which measures the ability to carry out daily activities (e.g., dressing) that are relevant for physically impaired SMA patients, but also dimensions such as seeing and hearing that are not impacted by the disease. A further study [106] highlighted the limitations of using vignettes to develop SMA case histories and asking clinical experts to value them using EQ-5D-Y. Indeed, standard descriptions of SMA states do not capture the heterogeneous manifestations of the disease, and the accuracy of HSUVs depends on the validity or accuracy of such descriptions and their proxy evaluation. However, the EQ-5D-Y has not been validated in children as young as patients with infantile-onset SMA and using physicians for proxy reporting allow to avoid some bias that parental judgment may introduce. Finally, a third study [156] highlighted the limitations of applying an existing mapping algorithm to SMA patients.
The only study dealing with severe refractory eosinophilic asthma [108] used both the generic EQ-5D and the disease-specific Asthma Symptom Utility Index (ASUI), commenting that the latter may show a floor effect for patients with severe asthma.
In Gaucher disease (GD) of type 1, the four studies providing methodological considerations used direct techniques to estimate HSUVs. The first [68] revealed that, among the techniques used for valuing vignettes by healthy volunteers, chronically ill patients and GD patients, the risk–risk trade-off yielded significantly lower utilities than SG and TTO and showed the poorest test–retest reliability. The second study [79] showed that the WTP approach (used to determine the maximum amount healthy volunteers were willing to pay to have an insurance scheme covering the treatment cost) produced results aligned with SG (used to determine the maximum risk of death they were willing to bear to avoid living with GD), and may be well-suited to elicit preferences for chronic conditions where it is unrealistic to trade-off life duration (as in TTO) or risk of immediate death (as in SG). The third study [84] pointed out the limitations of using health state descriptions that were developed without direct inputs from GD patients and valued by the general population. The fourth study [104] reported that using different search procedures (i.e., the method—titration and “ping-pong”—used to find the point of indifference) significantly impacted the HSUVs estimated by healthy volunteers.
The only study providing methodological considerations in primary biliary cholangitis suggested that HUI performed well since was able to discriminate between patients with mild and severe disease [154].
The only study [138] addressing limbal stem cell deficiency (LSCD) pointed out the limitations of using vignettes to be valued by members of the general public, since some participants failed to correctly interpret LSCD descriptions, and there was limited overlap between the EQ-5D-3L dimensions and LSCD vignettes, despite the inclusion of the vision bolt-on. However, given the rarity of the condition and the small sample achievable, deriving utilities directly from patients would have provided unreliable estimates.
In hereditary angioedema, a study mapping the HAE-BOIS survey items onto EQ-5D-3L reported that the items selected for manual cross-walking were conceptually overlapping with EQ-5D dimensions [53]. The two studies by Nordenfelt et al. [117, 118] reported that EQ-5D-5L is suitable to assess QoL in this condition, and also includes the pain element, as opposed to disease-specific instruments, such as AE-QoL.
Comparison between NICE reports and published studies
Table 4 summarizes the approaches used to derive HSUVs included in the manufacturer NICE submissions (as reported in TA/HST guidance documents, n = 22) with the existing published studies (n = 111) that estimated HSUVs for each individual rare disease considered in this review, irrespective of the publication date and the consequent availability (or not) to manufacturers for developing their economic models.
The use of ‘vignettes’ valued by non-patient populations (i.e., clinical experts or members of the public) was more frequent in NICE technology appraisals than in the literature (5/22, 22.7% vs. 9/112, 8.0%), as well as the use of ‘mapping’ from non-preference-based measures (1/22, 9.1% vs. 3/112, 2.7%). In 36.4% (8/22) of cases, the manufacturers used preference-based PROMs (i.e., EQ-5D and/or HUI) collected from patients (and/or caregivers), while this methodology (alone or in combinations with others) was adopted by 83.0% (93/112) of published studies. Moreover, the literature presented a much wider range of preference-based PROMs including also the generic QWB, SF-6D, and 15D, the CarerQol-7D (specifically for caregivers), and the disease-specific ASUI. The use of direct methods (e.g., SG, TTO, VAS, RRTO) was reported by one manufacturer submission only (1/22, 4.5%) versus 14 published studies (12.5%). One NICE document (4.5%) reported on a DCE performed with the general public, while this technique was not found in the literature reviewed. Finally, in ten NICE appraisals (45.5%) the manufacturers obtained utilities from the literature.
In relation to individual RDs, a high level of agreement in the methodological choices was observed in atypical haemolytic syndrome, where NICE technology appraisal (HST1) and the three studies retrieved from the literature [76, 103, 105] collected EQ-5D directly from patients, as well as in severe refractory eosinophilic asthma, where original utilities were estimated using EQ-5D in TA431, as in the study by Lloyd [108] with the addition of ASUI. Similarly, in Waldenstrom’s macroglobulinemia, both the NICE report (TA491) and published study [73] collected EQ-5D-5L from patients, as well as in idiopathic pulmonary fibrosis, where TA379 and 14 (out of 17) studies used EQ-5D (either 3L or 5L). In cystic fibrosis, the use of preference-based instruments (i.e., EQ-5D and HUI2) and mapping from CFQ was common between technology appraisals (TA266, TA276, TA398) and most of the studies retrieved (22 out of 36).
Conversely, a low agreement between the approaches used in the NICE report and the literature was found in relation to Fabry disease, where the former (HST4) performed a DCE with members of the general population, while the latter collected EQ-5D-3L in eight cases [51, 54, 88, 114, 115, 128, 131, 144] (and, in one study [152], also SF-36 to be valued using SF-6D). In addition, in X-linked hypophosphatemia, the technology appraisal (HST8) estimated HSUVs using vignettes valued by clinical experts through EQ-5D-5L, while in the only study retrieved [80] the same instrument was used to collect data directly from patients. In spinal muscular atrophy, the technology appraisal (TA588) relied on expert opinion and the literature to derive HSUVs, while a variety of approaches (including preference-based instruments collected from patients, caregivers, or clinical experts using vignettes) were presented by the five studies retrieved [56, 64, 106, 110, 156].
In Gaucher disease, the manufacturer’s approach (HST5) consisted in mapping SF-36 onto EQ-5D using a published algorithm; conversely, three published studies [71, 147, 152] collected preference-based measures from patients, while another four [68, 79, 84, 104] used vignettes to be valued using various direct techniques. In DMD, the manufacturer (HST3) derived utilities from the literature (because the algorithm available to convert PedsQL data collected in the clinical study onto EQ-5D utilities was developed in a healthy population), while several preference-based measures (i.e., EQ-5D, EQ-5D-Y, HUI2/3, CarerQol-7D) were used in eight (out of 10) published studies [60, 63, 97,98,99, 101, 102, 123].
Degree of exploitation of the existing literature in manufacturers’ submissions
In Table 5, the methods chosen by manufacturers to obtain HSUVs are compared with: (1) the alternative techniques recommended by NICE ERG and/or Committee (excluding suggestions of minor adjustments that do not alter the main technique, as reported in Table 1) and (2) the techniques adopted in studies that were published at least one year before the TA/HST date (and, therefore, presumably available to manufacturers at the time of drafting their submission), together with a synthesis of study’s authors methodological considerations.
Overall, 72 studies (out of 112, 64.3%) were assumed to be available to manufacturers at the time of their drug submissions. In detail, five studies [75, 102, 117, 150, 154] included in our literature review were explicitly mentioned by the manufacturers as a source to obtain utilities that were not collected in their clinical studies (HST3, TA398, TA443, TA606), or to derive published coefficients [75] to perform a mapping exercise (TA276). In three cases [75, 117, 154], the studies provided positive recommendations regarding the instruments used (i.e., EQ-5D-Y, EQ-5D-5L and HUI2), while in the remaining two [102, 150] no considerations were given by the authors. The report TA443 referred to another study (full reference not provided), already used in a TA on sofosbuvir for treating chronic hepatitis C. One report (HST9) used EQ-5D utilities estimated in a previous study (full reference not provided) on the same condition (i.e., hereditary transthyretin amyloidosis), but with a different country value set. One report (TA491) addressing Waldenstrom’s macroglobulinaemia referred to two published studies [157, 158] on chronic lymphocytic leukaemia (which were not captured by our review, since we excluded cancer). In four cases (TA266, HST2, HST7, TA588), no details were given on the literature source used.
Overall, the association between study authors’ methodological considerations and manufacturers’ choices is not straightforward. In some cases, such as TA398, the use of EQ-5D is corroborated by several studies [49, 50, 59, 67] arguing the instrument is a valid and responsive instrument in cystic fibrosis. Similarly, in idiopathic pulmonary fibrosis, five (out of seven) available studies at the time of NICE assessment report (TA379) used EQ-5D and one [85] also reassured on its sensitivity in this condition. In other cases, such as HST2 and HST8, the manufacturers adopted EQ-5D despite the negative judgment reported by the literature, which showed that it did not correlate with pain measures in Mucopolysaccharidosis type IVa [86] or considered it as unable to capture specific issues in X-linked hypophosphatemia [80]. In IRDs, despite EQ-5D was shown to present significant ceiling effects and considered unsuitable for patients with eye conditions [70], the manufacturer (HST11) used both EQ-5D and HUI3, while the ERG suggested to refer to a previous study deriving TTO values from the general public. In these cases, the choice of using EQ-5D is more likely motivated by the willingness to comply with NICE’s recommendations than with the literature’s considerations. The first report on cystic fibrosis (TA266), in using the HUI3, might have followed some positive judgments on this instrument as reported by studies available at that time [50, 91, 125, 153], which highlighted its responsiveness, ease of understanding, and inclusion of relevant items, such as hearing and fertility, while the evidence on the use of EQ-5D was still scarce. Finally, there are few cases where the manufacturers’ methodological choices seem completely unrelated to the literature. For example, in HST4, a DCE with the public was presented, in spite of the availability of multiple studies using EQ-5D and suggestion from ERG to use values from the literature (unspecified).
Discussion
Synthesis of results
This study aimed to identify, examine and compare the approaches used in NICE technology appraisals and published literature on selected RDs in order to understand the level of agreement between the two and whether the manufacturers fully exploited the existing literature in developing their HTA submissions. A total of 22 TA or HST appraisals encompassing 19 different conditions were downloaded from NICE website. In a substantial number of submissions (10/22, 45.5%), the manufacturers used HSUVs from the literature, as the only approach or combined with primary utility data collected in clinical trials, to inform specific health states (e.g., lung transplantation in reports on cystic fibrosis, TA266 and TA398). However, in more than one third of submissions (8/22), preference-based instruments (EQ-5D or HUI2) were collected from patients during clinical studies. The comments of NICE committees on the approach adopted by manufacturers to obtain utilities for their models were heterogeneous; as expected, more appreciation was expressed when the manufacturers collected EQ-5D directly from patients during a clinical trial (which is the NICE’s recommended approach [9]). In few cases the ERG suggested a completely different approach to estimate HSUVs (e.g., in TA276, using literature values instead of mapping) while more often it recommended only minor adjustments to the manufacturer’s choices.
A total of 111 published studies were identified from the literature, of which one was counted twice because of addressed two different conditions in two separate article sections; therefore, data analysis was performed on 112 studies. One third were related to cystic fibrosis that, with a current prevalence of 11.1 per 100,000 [21], is one of the most common RDs. Almost one fourth were conducted in multiple countries, which is frequent in RDs due to high patients’ geographical dispersion [12]. More than two thirds were published in the last decade (2011–2020), which demonstrates a growing attention for QoL in RDs (and in general), and its incorporation into HTA appraisals [16, 19].
A total of 97 studies (86.6%) used at least one preference-based instrument to estimate HSUVs. Of the six main generic PROMs reported in the literature [8], all were used except for the Australian AQoL. Of them, the most frequently adopted (in 64.3% of studies, 72/112) was by far a EuroQol questionnaire (EQ-5D-3L, EQ-5D-5L or EQ-5D-Y), followed by HUI2/3 (14/112, 12.5%). The three-level version of EQ-5D (i.e., EQ-5D-3L) was first introduced by EuroQol in 1990 (www.euroqol.org), but never used in this review before 2000. The HUI, introduced in 2003, has been rarely but constantly adopted in the included studies since the beginning of the new millennium. The QWB, developed in the 1970s, was adopted by 11 studies (9.8%), all published before 2005, and subsequently likely replaced by more recent instruments. The 15D was used only by three Finnish studies [46, 47, 83], and the SF-6D by one study only [152] (but addressing two different RDs) and together with EQ-5D. The use of direct techniques was much less frequent; indeed, only 14 studies (12.5%) relied on these methodologies, of which 11 (9.8%) used SG. Finally, only three studies (2.7%) [48, 53, 101] performed a mapping exercise to obtain HSUVs from non-preference-based measures. This is not surprisingly, given the limitations related to the use of mapping in RDs emerged from our previous review [13].
In 11 studies (9.8%), parents (or other caregivers) were involved in proxy-reporting on patients’ status in addition to or in place of their children (especially if aged below 12). This review confirmed a growing consideration of caregivers’ and, in general, families’ well-being in research [16], since 12 studies (10.7%) interviewed the main carers about their own status, and also in HTA [15], as observed in three NICE appraisals (TA588, HST3, HST12) and related positive comments from ERG. The participants enrolled were less than 100 in 42% of studies, which is frequent in RDs where small-scale studies are usually conducted [12, 16, 19].
The degree of overlap between the methods adopted by manufacturers for their submission to NICE and published studies was quite limited and varied a lot according to the disease considered. In particular, the use of ‘vignettes’ valued by clinical experts and reliance on expert opinion to estimate HSUVs was much more frequent in manufacturers’ submissions than in the published literature (5/22, 22.7% vs. 9/112, 8.0%), where the great majority of studies (85.7%) surveyed patients directly to value their own status. It is worth noting that four (out of five) NICE technology appraisals reporting the use of vignettes were HST targeting ultra-RDs, in which it may be difficult to recruit patients. The reasons for diverging from the existing literature might be different. In some cases (e.g., HST6 and HST12), we realized that there were no published studies to refer to, while in others (e.g., HST1 and HST8) very few studies were available at the time of submission to NICE. Again, this situation was much more common for HST targeting ultra-rare diseases. In the submission related to IRDs (HST11), the company expressed a preference for HUI3 instead of EQ-5D, recommended by NICE and used by a previous study [70]), because the former contains a vision component while the latter is known to have poor validity in eye disorders [30]. In other reports, the patients’ characteristics, in terms of age or disease’s subtype, might be different from the samples enrolled in published studies, thus inducing manufacturers to perform a new data collection or embrace different approaches. For example, the eculizumab submission (HST1 [26]) also included paediatric patients (0–11 years), while the study retrieved in atypical haemolytic–uremic syndrome recruited only patients above 12 [103]. Similarly, burosumab (HST8 [24]) was authorized for children and adolescents, while the corresponding study on X-linked hypophosphatemia was focused on adults [80]; this might have incentivized the manufacturer to conduct an ad hoc vignette study where clinicians, instead of patients, completed the EQ-5D. However, it is difficult to identify a clear pattern in manufacturers’ choices, but in several cases, they seem unrelated to the literature. The choice of using EQ-5D, indeed, is more likely motivated by the willing to comply with NICE methods guidance [9]. In the appraisal reports reviewed, manufacturers rarely referred to (or commented on) the available literature on HSUVs. In few cases, the company admitted using literature data, because QoL or utility data were not (or not sufficiently) collected in clinical trials (HST7, TA606, TA443), or there were no suitable mapping algorithms to convert PROMs used in the clinical studies (e.g., Norfolk QOL-DN, PedsQL) to the EQ-5D or other utilities (HST3, HST9, HST11). In other cases (HST8, HST11), the manufacturers reported the lack of literature data (and/or mapping algorithms) and, therefore, the necessity to commission an ad hoc vignette study. Future research should adopt qualitative methods (e.g., focus groups, interviews) to investigate the deep reasons why manufacturers exploited (or not) the literature to inform their new drug submissions.
Limitations
Our research has several limitations. First, in order to limit the number of studies to be included in the literature, we deliberately excluded those using secondary sources to derive HSUVs. Thus, we could not compare NICE technology appraisals and published studies in terms of the frequency in use of such an approach. Second, we only searched two databases (PubMed and ScHARRHUD), while using other sources (e.g., Embase) and checking the reference lists of the included studies could have provided additional results. Third, except for the pilot phase, the data extraction was done independently by a single author, while should be conducted by at least two people [159]. Fourth, we did not perform any quality assessment of the included studies; however, this task is considered optional by PRISMA-ScR (item 12) [22]. Fifth, although we considered studies published at least one year before the NICE report date to understand whether manufacturers fully exploited the literature, it is possible that they develop their economic analyses more than one year ahead of the submission to NICE, in which case they would not be able to consult and reference some of the studies used for comparing the approaches. Therefore, the comparison between NICE reports and the literature should be interpreted with caution and not be viewed as direct criticism of manufacturers, but rather as an encouragement for using different approaches or values existing in the literature in future submissions, given the availability of estimates and the methodological considerations discussed by authors.
Key learnings
This review provided several insights into research and HTA. The critical issues highlighted in previous studies undertaken within the IMPACT-HTA project [12, 13, 19] and related to the general use of PROMs and the estimation of HSUVs in RDs were confirmed by the manuscripts reviewed, as well as their authors’ considerations. First, due to the low prevalence of RDs, the samples recruited in studies were generally small and might have affected the validity of results. In several cases, the ultra-rarity of condition (e.g., IRDs) hampered the enrolment of a representative patient sample and required the use of ‘vignettes’. The wide use of non-patient populations is a key issue in the estimation of HSUVs in RDs, since most patients are children or cognitively impaired, and cannot perform complex tasks, such as those required by direct techniques. Second, even in studies using simpler indirect approaches (e.g., EQ-5D), patients were often children who necessarily required parental support in order to fill in the required questionnaire to obtain HSUVs. The use of paediatric versions of instruments (e.g., EQ-5D-Y) can overcome the issue only partially, since they do not apply to mentally disabled or very young children.
Third, despite their advantages (i.e., simplicity, cheapness, comparability across different conditions), preference-based PROMs showed poor sensitivity in several RDs, due to their heterogeneity and peculiarities. For example, in conditions implying severe physical impairment (e.g., DMD), the EQ-5D does not consider mobility issues beyond walking, or do not include visual domains for eye conditions (e.g., limbal stem cell deficiency), unless adding the vision bolt-on. Moreover, EQ-5D also presented ceiling effects in mild disease. Fourth, patients’ geographical dispersion is likely to have required multi-country research studies and multi-lingual instruments, which inevitably imply more logistic and financial resources.
In this review, a variety of approaches were identified to estimate HSUVs. EQ-5D, which is currently the most frequently cited instrument (85%) in national pharmacoeconomic guidelines [160], was also the preferred instrument in the studies reviewed, with over 64% using it alone or in combination with other approaches. Similarly, 15 out of 22 (68.2%) manufacturers’ submissions presented EQ-5D utilities derived in different ways (i.e., from patients, from clinical experts using ‘vignettes’, by ‘mapping’ non-preference-based measures, or using values from the literature), in agreement with NICE guidelines indicating that EQ-5D can be sourced from the literature or by mapping, if not available in the clinical trials [9]. On the basis of methodological considerations extracted from published studies, EQ-5D is found to be an easy, valid and sensitive instrument in chronic lung diseases (i.e., cystic fibrosis and idiopathic pulmonary fibrosis), and also allows comparison with other conditions. However, it also includes irrelevant aspects (i.e., self-care) and present significant ceiling effects, especially in less severe health states. The instrument was presented as valid and appropriate also in hereditary transthyretin amyloidosis and hereditary angioedema. In DMD, instead, EQ-5D is valued as almost insensitive in capturing relevant disease’s changes, especially because does not consider any mobility aspects besides walking. The main EQ-5D limitations (i.e., poor sensitivity, ceiling effects) were also highlighted in other rare conditions (i.e., Fabry disease, IRD, mucopolysaccharidosis type Iva, X-linked hypophosphatemia). The NICE, on the other hand, acknowledged that the EQ-5D may not be the most appropriate instrument in some cases, and alternative QoL measures may be used provided that the lack of content validity for the EQ-5D is proven [9]. For example, in HST10, NICE’s committee considered that EQ-5D might not fully capture all the effects of autonomic neuropathy [29].
The comments about the other two frequently used preference-based PROMs (i.e., HUI2/3 and QWB) identified in the literature review (which were used by 12.5% and 9.8% of published studies, respectively) were heterogeneous. Similar to EQ-5D, HUI2/3 was found to perform well in rare lung diseases and include relevant dimensions (e.g., hearing, fertility) for cystic fibrosis patients, but not relevant enough for physically impaired patients, such as those affected by DMD and SMA. QWB was used by eleven studies all performed in chronic lung diseases (i.e., cystic fibrosis and idiopathic pulmonary fibrosis), where the evaluation of its performance and validity was mixed. In relation to direct techniques, it is worth mentioning that two studies [61, 148] stated that SG tends to over or underestimate HSUVs, depending on respondent’s willingness to accept the risk. Finally, the use of ‘vignettes’ valued by non-patient populations was often perceived as a ‘second-best’ approach justified by low prevalence of disease and consequent difficulties in recruiting patients [107]. Indeed, some studies [84, 138] highlighted the limitations of such an approach, mainly regarding standard descriptions of health states that do not capture the heterogeneous manifestations of diseases, development of vignettes in the absence of patient’s inputs, limited overlap between health state descriptions and the instrument (e.g., EQ-5D) used to value them, and difficulties for the general public to identify themselves in hypothetical RD patient descriptions.
Conclusions
This is the first paper that analyses and compares the methods used to estimate HSUVs in NICE TA and HST appraisal of drugs with an EMA orphan designation with the published literature for the same targeted RDs. The study confirmed that EQ-5D is the preferred preference-based instrument, as recommended by NICE and other HTA authorities. The agreement on methodological choices between the two types of documents (i.e., NICE appraisals and published manuscripts) was only partial, since the former relied more often on ‘vignettes’ and clinical experts. Despite almost half of the appraisal reports reviewed referred to the literature (sometimes even without specifying the study referred to), more efforts should be made by manufacturers to exploit comprehensively the methods, results and considerations found in the published studies, which occasionally advises against using EQ-5D in selected RDs. At the same time, since methodological considerations retrieved from the existing literature are often inconsistent, future research could further investigate the most appropriate methods to estimate HSUVs for individual RDs and identify the ways to overcome the limitations of existing instruments. However, the wide use of EQ-5D in published research studies should provide some reassurance concerning the validity of this instrument at least in some RDs, such as those affecting respiratory system (i.e., cystic fibrosis and idiopathic pulmonary fibrosis). The use of condition-specific preference-based measures, which was very limited in our review, is also encouraged at least to determine their validity and responsiveness in comparison with the generic ones. Moreover, a deeper understanding of proxy-reporting for paediatric or cognitively impaired patients is required, to assess the validity of this frequently used approach in RDs, as well as the impact of including their carers’ QoL on HTA results.
References
Bogart, K.R., Irvin, V.L.: Health-related quality of life among adults with diverse rare disorders. Orphanet J. Rare Dis. 12(1), 177 (2017)
U.S. Food and Drug Administration (FDA): Patient-Reported Outcome Measures: Use in Medical Product Development to Support Labeling Claims (2009). https://www.fda.gov/regulatory-information/search-fda-guidance-documents/patient-reported-outcome-measures-use-medical-product-development-support-labeling-claims. Accessed 16 Mar 2021
Weldring, T., Smith, S.M.: Patient-reported outcomes (PROs) and patient-reported outcome measures (PROMs). Health Serv. Insights 6, 61–68 (2013)
Bansback, N., Harrison, M., Brazier, J., Davies, L., Kopec, J., Marra, C., et al.: Health state utility values: a description of their development and application for rheumatic diseases. Arthritis Rheum. 59(7), 1018–1026 (2008)
Lamers, L.M.: The transformation of utilities for health states worse than death: consequences for the estimation of EQ-5D value sets. Med. Care 45(3), 238–244 (2007)
Arnold, D., Girling, A., Stevens, A., Lilford, R.: Comparison of direct and indirect methods of estimating health state utilities for resource allocation: review and empirical analysis. Br. Med. J. 339, b2688 (2009)
Bahrampour, M., Byrnes, J., Norman, R., Scuffham, P.A., Downes, M.: Discrete choice experiments to generate utility values for multi-attribute utility instruments: a systematic review of methods. Eur. J. Health Econ. 21(7), 983–992 (2020)
Brazier, J., Ara, R., Rowen, D., Chevrou-Severac, H.: A review of generic preference-based measures for use in cost-effectiveness models. Pharmacoeconomics 35(Suppl 1), 21–31 (2017)
National Institute for Health and Care Excellence (NICE): guide to the methods of technology appraisal 2013 (2013). https://www.nice.org.uk/process/pmg9/resources/guide-to-the-methods-of-technology-appraisal-2013-pdf-2007975843781. Accessed 4 Jul 2021
National Institute for Health and Care Excellence (NICE): Position statement on use of the EQ-5D-5L value set for England (updated October 2019). https://www.nice.org.uk/about/what-we-do/our-programmes/nice-guidance/technology-appraisal-guidance/eq-5d-5l. Accessed 19 Apr 2022
Longworth, L., Rowen, D.: Mapping to obtain EQ-5D utility values for use in NICE health technology assessment. Value Health 16(1), 202–210 (2013)
Meregaglia, M., Nicod, E., Drummond, M.: The estimation of health state utility values in rare diseases: overview of existing techniques. Int. J. Technol. Assess. Health Care 36(5), 469–473 (2020)
Meregaglia, M., Whittal, A., Nicod, E., Drummond, M.: “Mapping” health state utility values from non-preference-based measures: a systematic literature review in rare diseases. Pharmacoeconomics 38(6), 557–574 (2020)
Rowen, D., Brazier, J., Ara, R., Azzabi, Z.I.: The role of condition-specific preference-based measures in health technology assessment. Pharmacoeconomics 35(Suppl 1), 33–41 (2017)
McLoughlin, C., Goranitis, I., Al-Janabi, H.: Validity and responsiveness of preference-based quality-of-life measures in informal carers: a comparison of 5 measures across 4 conditions. Value Health 23(6), 782–790 (2020)
Slade, A., Isa, F., Kyte, D., Pankhurst, T., Kerecuk, L., Ferguson, J., et al.: Patient reported outcome measures in rare diseases: a narrative review. Orphanet J. Rare Dis. 13(1), 61 (2018)
Goodrich, K., Kaambwa, B., Al-Janabi, H.: The inclusion of informal care in applied economic evaluation: a review. Value Health 15(6), 975–981 (2012)
Nicod, E., Meregaglia, M., Whittal, A., Upadhyaya, S., Facey, K., Drummond, M.: Consideration of quality of life in the health technology assessments of rare disease treatments. Eur. J. Health Econ. 23(4), 645–669 (2022)
Whittal, A., Meregaglia, M., Nicod, E.: The use of patient reported outcome measures in rare diseases and implications for health technology assessment. Patient 14(5), 485–503 (2021)
Powell T., O'Donnell M.: NICE appraisals of rare diseases (2019). https://researchbriefings.files.parliament.uk/documents/CDP-2019-0022/CDP-2019-0022.pdf. Accessed 5 Nov 2021
Orphanet: Prevalence and incidence of rare diseases: bibliographic data. Orphanet Report Series. 1 (2022) https://www.orpha.net/orphacom/cahiers/docs/GB/Prevalence_of_rare_diseases_by_alphabetical_list.pd. Accessed 13 Apr 2022
Tricco, A.C., Lillie, E., Zarin, W., O’Brien, K.K., Colquhoun, H., Levac, D., et al.: PRISMA extension for scoping reviews (PRISMA-ScR): checklist and explanation. Ann. Intern. Med. 169(7), 467–473 (2018)
Munn, Z., Peters, M.D., Stern, C., Tufanaru, C., McArthur, A., Aromataris, E.: Systematic review or scoping review? Guidance for authors when choosing between a systematic or scoping review approach. BMC Med. Res. Methodol. 18(1), 143 (2018)
National Institute for Health and Care Excellence (NICE): Burosumab for treating X-linked hypophosphataemia in children and young people. Highly specialised technologies guidance (HST8). 10 October 2018. https://www.nice.org.uk/guidance/HST8. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Asfotase alfa for treating paediatric onset hypophosphatasia. Highly Specialised Technology Guidance (HST6). 2 August 2017. https://www.nice.org.uk/guidance/hst6. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Eculizumab for treating atypical haemolytic uraemic syndrome. Highly specialised technologies guidance (HST1). 28 January 2015. https://www.nice.org.uk/guidance/hst1. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Migalastat for treating Fabry disease. Highly specialised technologies guidance (HST4). 22 February 2017. https://www.nice.org.uk/guidance/hst4. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Inotersen for treating hereditary transthyretin amyloidosis. Highly specialised technology guidance (HST9). 22 May 2019. https://www.nice.org.uk/guidance/HST9. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Patisiran for treating hereditary transthyretin amyloidosis. Highly specialised technology guidance (HST10). 14 August 2019. https://www.nice.org.uk/guidance/HST10. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Voretigene neparvovec for treating inherited retinal dystrophies caused by RPE65 gene mutations. Highly specialised technologies guidance (HST11). 9 October 2019. https://www.nice.org.uk/guidance/HST11. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Mannitol dry powder for inhalation for treating cystic fibrosis. Technology Appraisal Guidance (TA266). 28 November 2012. https://www.nice.org.uk/Guidance/TA266. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Lumacaftor–ivacaftor for treating cystic fibrosis homozygous for the F508del mutation. Technology Appraisal Guidance (TA398). 27 July 2016. https://www.nice.org.uk/guidance/ta398. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Colistimethate sodium and tobramycin dry powders for inhalation for treating pseudomonas lung infection in cystic fibrosis. Technology appraisal guidance (TA276). 27 March 2013. https://www.nice.org.uk/guidance/ta276. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Elosulfase alfa for treating mucopolysaccharidosis type IVa. Highly specialised technologies guidance (HST2). 16 December 2015. https://www.nice.org.uk/guidance/hst2. Accessed 16 Jun 2020
National Institute for Health and Care Excellence (NICE): Strimvelis for treating adenosine deaminase deficiency - severe combined immunodeficiency. Highly specialised technology guidance (HST7). 7 February 2018. https://www.nice.org.uk/guidance/hst7. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Nusinersen for treating spinal muscular atrophy. Technology Appraisal Guidance (TA588). 24 July 2019. https://www.nice.org.uk/guidance/TA588. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Mepolizumab for treating severe refractory eosinophilic asthma. Technology appraisal guidance (TA431). 25 January 2017. https://www.nice.org.uk/guidance/ta431. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Eliglustat for treating type 1 Gaucher disease. Highly specialised technology guidance (HST5). 28 June 2017. https://www.nice.org.uk/guidance/HST5. Accessed 17 Jun 2020
National Institue for Health and Care Excellence (NICE): Holoclar for treating limbal stem cell deficiency after eye burns. Technology appraisal guidance (TA467). 16 August 2017. https://www.nice.org.uk/guidance/ta467. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Lanadelumab for preventing recurrent attacks of hereditary angioedema. Technology appraisal guidance (TA606). 16 October 2019. https://www.nice.org.uk/guidance/ta606. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Obeticholic acid for treating primary biliary cholangitis. Technology appraisal guidance (TA443). 26 April 2017. https://www.nice.org.uk/guidance/ta443. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Ibrutinib for treating Waldenström’s macroglobulinaemia. Technology appraisal guidance (TA491). 22 November 2017. https://www.nice.org.uk/guidance/ta491. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Nintedanib for treating idiopathic pulmonary fibrosis. Technology appraisal guidance (TA379). 27 January 2016. https://www.nice.org.uk/guidance/TA379/chapter/1-Recommendations. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Ataluren for treating Duchenne muscular dystrophy with a nonsense mutation in the dystrophin gene (HST3). 20 July 2016. https://www.nice.org.uk/guidance/hst3. Accessed 17 Jun 2020
National Institute for Health and Care Excellence (NICE): Cerliponase alfa for treating neuronal ceroid lipofuscinosis type 2. Highly specialised technologies guidance (HST12). 27 November 2019. https://www.nice.org.uk/guidance/HST12. Accessed 17 Jun 2020
Åberg, F., Höckerstedt, K., Roine, R., Sintonen, H., Isoniemi, H.: Influence of liver-disease etiology on long-term quality of life and employment after liver transplantation. Clin. Transplant. 26(5), 729–735 (2012)
Åberg, F., Mäklin, S., Räsänen, P., Roine, R.P., Sintonen, H., Koivusalo, A.M., et al.: Cost of a quality-adjusted life year in liver transplantation: the influence of the indication and the model for end-stage liver disease score. Liver Transpl. 17(11), 1333–1343 (2011)
Acaster, S., Pinder, B., Mukuria, C., Copans, A.: Mapping the EQ-5D index from the cystic fibrosis questionnaire-revised using multiple modelling approaches. Health Qual. Life Outcomes 13, 33 (2015)
Angelis, A., Kanavos, P., López-Bastida, J., Linertová, R., Nicod, E., Serrano-Aguilar, P., et al.: Social and economic costs and health-related quality of life in non-institutionalised patients with cystic fibrosis in the United Kingdom. BMC Health Serv. Res. 15, 428 (2015)
Anyanwu, A.C., McGuire, A., Rogers, C.A., Murday, A.J.: Assessment of quality of life in lung transplantation using a simple generic tool. Thorax 56(3), 218–222 (2001)
Arends, M., Körver, S., Hughes, D.A., Mehta, A., Hollak, C.E.M., Biegstraaten, M.: Phenotype, disease severity and pain are major determinants of quality of life in Fabry disease: results from a large multicenter cohort study. J. Inherit. Metab. Dis. 41(1), 141–149 (2018)
Aygören-Pürsün, E., Soteres, D.F., Nieto-Martinez, S.A., Christensen, J., Jacobson, K.W., Moldovan, D., et al.: A randomized trial of human C1 inhibitor prophylaxis in children with hereditary angioedema. Pediatr. Allergy Immunol. 30(5), 553–561 (2019)
Aygören-Pürsün, E., Bygum, A., Beusterien, K., Hautamaki, E., Sisic, Z., Boysen, H.B., et al.: Estimation of EuroQol 5-Dimensions health status utility values in hereditary angioedema. Patient Prefer. Adherence 10, 1699–1707 (2016)
Beck, M., Ricci, R., Widmer, U., Dehout, F., de Lorenzo, A.G., Kampmann, C., et al.: Fabry disease: overall effects of agalsidase alfa treatment. Eur. J. Clin. Invest. 34(12), 838–844 (2004)
Bell, S.C., Mainz, J.G., MacGregor, G., Madge, S., Macey, J., Fridman, M., et al.: Patient-reported outcomes in patients with cystic fibrosis with a G551D mutation on ivacaftor treatment: results from a cross-sectional study. BMC Pulm Med. 19(1), 146 (2019)
Belter, L., Cruz, R., Jarecki, J.: Quality of life data for individuals affected by spinal muscular atrophy: a baseline dataset from the cure SMA community update survey. Orphanet J. Rare Dis. 15(1), 217 (2020)
Bleisch, B., Schuurmans, M.M., Klaghofer, R., Benden, C., Seiler, A., Jenewein, J.: Health-related quality of life and stress-related post-transplant trajectories of lung transplant recipients: a three-year follow-up of the Swiss transplant cohort study. Swiss Med. Wkly. 149, 07–08 (2019)
Bloem, A.E.M., Mostard, R.L.M., Stoot, N., Vercoulen, J.H., Peters, J.B., Janssen, D.J.A., et al.: Severe fatigue is highly prevalent in patients with IPF or sarcoidosis. J. Clin. Med. 9(4), 1178 (2020)
Bradley, J.M., Blume, S.W., Balp, M.M., Honeybourne, D., Elborn, J.S.: Quality of life and healthcare utilisation in cystic fibrosis: a multicentre study. Eur. Respir. J. 41(3), 571–577 (2013)
Bray, N., Noyes, J., Harris, N., Tudor, E.R.: Measuring the health-related quality of life of children with impaired mobility: examining correlation and agreement between children and parent proxies. BMC Res. Notes 10(1), 377 (2017)
Brown, G.C.: Vision and quality-of-life. Trans. Am. Ophthalmol. Soc. 97, 473–511 (1999)
Busschbach, J.J., Horikx, P.E., van den Bosch, J.M., de la Rivière, A.B., de Charro, F.T.: Measuring the quality of life before and after bilateral lung transplantation in patients with cystic fibrosis. Chest 105(3), 911–917 (1994)
Cavazza, M., Kodra, Y., Armeni, P., De Santis, M., López-Bastida, J., Linertová, R., et al.: Social/economic costs and health-related quality of life in patients with Duchenne muscular distrophy in Europe. Eur. J. Health Econ. 17(Suppl 1), S19–S29 (2016)
Chambers, G.M., Settumba, S.N., Carey, K.A., Cairns, A., Menezes, M.P., Ryan, M., et al.: Prenusinersen economic and health-related quality of life burden of spinal muscular atrophy. Neurology 95(1), e1–e10 (2020)
Chang, J.A., Curtis, J.R., Patrick, D.L., Raghu, G.: Assessment of health-related quality of life in patients with interstitial lung disease. Chest 116(5), 1175–1182 (1999)
Chevreul, K., Michel, M., Berg, B.K., López-Bastida, J., Linertová, R., Oliva-Moreno, J., et al.: Social/economic costs and health-related quality of life in patients with cystic fibrosis in Europe. Eur. J. Health Econ. 17(Suppl 1), S7–S18 (2016)
Chevreul, K., Berg, B.K., Michel, M., Rault, G.: BURQOL-RD research network: costs and health-related quality of life of patients with cystic fibrosis and their carers in France. J. Cyst. Fibros. 14(3), 384–391 (2015)
Clarke, A.E., Goldstein, M.K., Michelson, D., Garber, A.M., Lenert, L.A.: The effect of assessment method and respondent population on utilities elicited for Gaucher disease. Qual. Life Res. 6(2), 169–184 (1997)
Czyzewski, D.I., Mariotto, M.J., Bartholomew, L.K., LeCompte, S.H., Sockrider, M.M.: Measurement of quality of well being in a child and adolescent cystic fibrosis population. Med. Care 32(9), 965–972 (1994)
Davison, N., Payne, K., Eden, M., McAllister, M., Roberts, S.A., Ingram, S., et al.: Exploring the feasibility of delivering standardized genomic care using ophthalmology as an example. Genet. Med. 19(9), 1032–1039 (2017)
Deegan, P.B., Pavlova, E., Tindall, J., Stein, P.E., Bearcroft, P., Mehta, A., et al.: Osseous manifestations of adult gaucher disease in the era of enzyme replacement therapy. Medicine (Baltimore) 90(1), 52–60 (2011)
DeWitt, E.M., Grussemeyer, C.A., Friedman, J.Y., Dinan, M.A., Lin, L., Schulman, K.A., et al.: Resource use, costs, and utility estimates for patients with cystic fibrosis with mild impairment in lung function: analysis of data collected alongside a 48-week multicenter clinical trial. Value Health 15(2), 277–283 (2012)
Dimopoulos, M.A., Trotman, J., Tedeschi, A., Matous, J.V., Macdonald, D., Tam, C., et al.: Ibrutinib for patients with rituximab-refractory Waldenström’s macro globulinaemia (iNNOVATE): an open-label substudy of an international, multicentre, phase 3 trial. Lancet Oncol. 18(2), 241–250 (2017)
Dunlevy, C.L., Herbert, D.F., Hill, E., Baez, S., Clutter, J.: Physiological and psychological effects of low-impact aerobic exercise on young adults with cystic fibrosis. J. Cardiopulm. Rehabil. 14, 47–51 (1994)
Eidt-Koch, D., Mittendorf, T., Greiner, W.: Cross-sectional validity of the EQ-5D-Y as a generic health outcome instrument in children and adolescents with cystic fibrosis in Germany. BMC Pediatr. 9, 55 (2009)
Fakhouri, F., Hourmant, M., Campistol, J.M., Cataland, S.R., Espinosa, M., Osama, G.A., et al.: Terminal complement inhibitor eculizumab in adult patients with atypical hemolytic uremic syndrome: a single-arm, open-label trial. Am. J. Kidney Dis. 68(1), 84–93 (2016)
Fitzgerald, C., George, S., Somerville, R., Linnane, B., Fitzpatrick, P.: Caregiver burden of parents of young children with cystic fibrosis. J Cyst Fibros 17(1), 125–131 (2018)
Fitzgerald, D.A., Hilton, J., Jepson, B., Smith, L.: A crossover, randomized, controlled trial of dornase alfa before versus after physiotherapy in cystic fibrosis. Pediatrics 116(4), e549–e554 (2005)
Flowers, C.R., Garber, A.M., Bergen, M.R., Lenert, L.A.: Willingness-to-pay utility assessment: feasibility of use in normative patient decision support systems. Proc AMIA Annu Fall Symp 223–7 (1997)
Forestier-Zhang, L., Watts, L., Turner, A., Teare, H., Kaye, J., Barrett, J., et al.: Health-related quality of life and a cost-utility simulation of adults in the UK with osteogenesis imperfecta, X-linked hypophosphatemia and fibrous dysplasia. Orphanet J. Rare Dis. 11(1), 160 (2016)
Gold, L.S., Patrick, D.L., Hansen, R.N., Beckett, V., Goss, C.H., Kessler, L.: Correspondence between symptoms and preference-based health status measures in the STOP study. J. Cyst. Fibros. 18(2), 251–264 (2019)
Groen, H., van der Bij, W., Koëter, G.H., TenVergert, E.M.: Cost-effectiveness of lung transplantation in relation to type of end-stage pulmonary disease. Am. J. Transplant. 4(7), 1155–1162 (2004)
Haapamäki, J., Tenca, A., Sintonen, H., Barner-Rasmussen, N., Färkkilä, M.A.: Health-related quality of life among patients with primary sclerosing cholangitis. Liver Int. 35(9), 2194–2201 (2015)
Hadi, M., Swinburn, P., Nalysnyk, L., Hamed, A., Mehta, A.: A health state utility valuation study to assess the impact of treatment mode of administration in Gaucher disease. Orphanet J. Rare Dis. 13(1), 159 (2018)
Han, M.K., Bach, D.S., Hagan, P.G., Yow, E., Flaherty, K.R., Toews, G.B., et al.: Sildenafil preserves exercise capacity in patients with idiopathic pulmonary fibrosis and right-sided ventricular dysfunction. Chest 143(6), 1699–1708 (2013)
Hendriksz, C.J., Lavery, C., Coker, M., Kalkan, U.S., Jain, M., Bell, L.: Burden of disease in patients with Morquio A syndrome: results from an international patient-reported outcomes survey. Orphanet J. Rare Dis. 9, 32 (2014)
Hisashige, A.: Quality of life among patients with mucopolysaccharidoses. Appl. Res. Qual Life. 6, 105–113 (2011)
Hoffmann, B., Garcia de Lorenzo, A., Mehta, A., Beck, M., Widmer, U., Ricci, R., FOS European Investigators: Effects of enzyme replacement therapy on pain and health related quality of life in patients with Fabry disease data from FOS (Fabry outcome survey). J. Med. Genet. 42(3), 247–252 (2005)
Inês, M., Coelho, T., Conceição, I., Ferreira, L., de Carvalho, M., Costa, J.: Health-related quality of life in hereditary transthyretin amyloidosis polyneuropathy: a prospective, observational study. Orphanet J. Rare Dis. 15(1), 67 (2020)
Iskrov, G.G., Stefanov, R.S., López-Bastida, J., Linertová, R., Oliva-Moreno, J., Serrano-Aguilar, P.: BURQOL-RD research network: economic burden and health-related quality of life of patients with cystic fibrosis in Bulgaria. Folia Med. (Plovdiv) 57(1), 56–64 (2015)
Janse, A.J., Sinnema, G., Uiterwaal, C.S.P.M., Kimpen, J.L.L., Gemke, R.J.B.J.: Quality of life in chronic illness: children, parents and paediatricians have different, but stable perceptions. Acta Paediatr. 97(8), 1118–1124 (2008)
Janse, A.J., Uiterwaal, C.S.P.M., Gemke, R.J.B.J., Kimpen, J.L.L., Sinnema, G.: A difference in perception of quality of life in chronically ill children was found between parents and pediatricians. J. Clin. Epidemiol. 58(5), 495–502 (2005)
Johnson, J.A., Connolly, M., Zuberbuhler, P., Brown, N.E.: Health-related quality of life for adults with cystic fibrosis: a regression approach to assessing the impact of recombinant human DNase. Pharmacotherapy 20(10), 1167–1174 (2000)
King, T.E., Jr., Brown, K.K., Raghu, G., du Bois, R.M., Lynch, D.A., Martinez, F., et al.: BUILD-3: a randomized, controlled trial of bosentan in idiopathic pulmonary fibrosis. Am. J. Respir. Crit. Care Med. 184(1), 92–99 (2011)
Kotecha, J., Atkins, C., Wilson, A.: Patient confidence and quality of life in idiopathic pulmonary fibrosis and sarcoidosis. Sarcoidosis Vasc. Diffuse Lung Dis. 33(4), 341–348 (2016)
Kreuter, M., Swigris, J., Pittrow, D., Geier, S., Klotsche, J., Prasse, A., et al.: Health related quality of life in patients with idiopathic pulmonary fibrosis in clinical practice: insights-IPF registry. Respir. Res. 18(1), 139 (2017)
Landfeldt, E., Lindberg, C., Sejersen, T.: Improvements in health status and utility associated with ataluren for the treatment of nonsense mutation Duchenne muscular dystrophy. Muscle Nerve 61(3), 363–368 (2020)
Landfeldt, E., Mayhew, A., Straub, V., Lochmüller, H., Bushby, K., Lindgren, P.: Psychometric analysis of the pediatric quality of life inventory 3.0 neuromuscular module administered to patients with duchenne muscular dystrophy: a rasch analysis. Muscle Nerve 58(3), 367–373 (2018)
Landfeldt, E., Lindgren, P., Bell, C.F., Guglieri, M., Straub, V., Lochmüller, H., et al.: Quantifying the burden of caregiving in Duchenne muscular dystrophy. J. Neurol. 263(5), 906–915 (2016)
Landfeldt, E., Lindgren, P., Bell, C.F., Guaglieri, M., Straub, V., Lochmüller, H., et al.: Health-related quality of life in patients with Duchenne muscular dystrophy: a multinational, cross-sectional study. Dev. Med. Child Neurol. 58(5), 508–515 (2016)
Landfeldt, E., Mayhew, A., Eagle, M., Lindgren, P., Bell, C.F., Guaglieri, M., et al.: Development and psychometric analysis of the Duchenne muscular dystrophy functional ability self-assessment tool (DMDSAT). Neuromuscul. Disord. 25(12), 937–944 (2015)
Landfeldt, E., Lindgren, P., Bell, C.F., Schmitt, C., Guaglieri, M., Straub, V., et al.: The burden of Duchenne muscular dystrophy. An international, cross-sectional study. Neurology 83(6), 529–536 (2014)
Legendre, C.M., Licht, C., Muus, P., Greenbaum, L.A., Babu, S., Bedrosian, C., et al.: Terminal complement inhibitor eculizumab in atypical hemolytic-uremic syndrome. N. Engl. J. Med. 368(23), 2169–2181 (2013)
Lenert, L.A., Cher, D.J., Goldstein, M.K., Bergen, M.R., Garber, A.: The effect of search procedures on utility elicitations. Med. Decis. Making 18(1), 76–83 (1998)
Licht, C., Greenbaum, L.A., Muus, P., Babu, S., Bedrosian, C.L., Cohen, D.J., et al.: Efficacy and safety of eculizumab in atypical hemolytic uremic syndrome from 2-year extensions of phase 2 studies. Kidney Int. 87(5), 1061–1073 (2015)
Lloyd, A.J., Thompson, R., Gallop, K., Teynor, M.: Estimation of the quality of life benefits associated with treatment for spinal muscular atrophy. Clinicoecon. Outcomes Res. 11, 615–622 (2019)
Lloyd, A., Piglowska, N., Ciulla, T., Pitluck, S., Johnson, S., Buessing, M., et al.: Estimation of impact of RPE65-mediated inherited retinal disease on quality of life and the potential benefits of gene therapy. Br. J. Ophthalmol. 103(11), 1610–1614 (2019)
Lloyd, A., Price, D., Brown, R.: The impact of asthma exacerbations on health-related quality of life in moderate to severe asthma patients in the UK. Prim. Care Respir. J. 16(1), 22–27 (2007)
Longworth, L., Young, T., Buxton, M.J., Ratcliffe, J., Neuberger, J., Burroughs, A., et al.: Midterm cost-effectiveness of the liver transplantation program of England and wales for three disease groups. Liver Transpl. 9(12), 1295–1307 (2003)
López-Bastida, J., Peña-Longobardo, L.M., Aranda-Reneo, I., Tizzano, E., Sefton, M., et al.: Social/economic costs and health-related quality of life in patients with spinal muscular atrophy (SMA) in Spain. Orphanet J. Rare Dis. 12(1), 141 (2017)
Lumry, W.R., Craig, T., Zuraw, B., Longhurst, H., Baker, J., Henry Li, H., et al.: Health-related quality of life with subcutaneous C1-inhibitor for prevention of attacks of hereditary angioedema. J. Allergy Clin. Immunol. Pract. 6(5), 1733–41.e3 (2018)
Magnetta, D.A., Kang, J., Wearden, P.D., Smith, K.J., Feingold, B.: Cost-effectiveness of ventricular assist device destination therapy for advanced heart failure in duchenne muscular dystrophy. Pediatr. Cardiol. 39(6), 1242–1248 (2018)
Martinez, F.J., de Andrade, J.A., Anstrom, K.J., King, T.E., Jr., Raghu, G.: Randomized trial of N-acetylcysteine in idiopathic pulmonary fibrosis. N. Engl. J. Med. 370(22), 2093–2101 (2014)
Mehta, A., Beck, M., Elliott, P., Giugliani, R., Linhart, A., Sunder-Plassmann, G., et al.: Enzyme replacement therapy with agalsidase alfa in patients with Fabry’s disease: an analysis of registry data. Lancet 374(9706), 1986–1996 (2009)
Miners, A.H., Holmes, A., Sherr, L., Jenkinson, C., MacDermot, K.D.: Assessment of health-related quality-of-life in males with Anderson Fabry disease before therapeutic intervention. Qual. Life Res. 11(2), 127–133 (2002)
Munzenberger, P.J., Van Wagnen, C.A., Abdulhamid, I., Walker, P.C.: Quality of life as a treatment outcome in patients with cystic fibrosis. Pharmacotherapy 19(4), 393–398 (1999)
Nordenfelt, P., Dawson, S., Wahlgren, C.F., Lindfors, A., Mallbris, L., Björkander, J.: Quantifying the burden of disease and perceived health state in patients with hereditary angioedema in Sweden. Allergy Asthma Proc. 35(2), 185–190 (2014)
Nordenfelt, P., Nilsson, M., Lindfors, A., Wahlgren, C.F., Björkander, J.: Health-related quality of life in relation to disease activity in adults with hereditary angioedema in Sweden. Allergy Asthma Proc. 38(6), 447–455 (2017)
Obici, L., Berk, J.L., González-Duarte, A., Coelho, T., Gillmore, J., Schmidt, H.H.J., et al.: Quality of life outcomes in APOLLO, the phase 3 trial of the RNAi therapeutic patisiran in patients with hereditary transthyretin-mediated amyloidosis. Amyloid 27(3), 153–162 (2020)
Orenstein, D.M., Kaplan, R.M.: Measuring the quality of well-being in cystic fibrosis and lung transplantation. The importance of the area under the curve. Chest 100(4), 1016–1018 (1991)
Orenstein, D.M., Pattishall, E.N., Nixon, P.A., Ross, E.A., Kaplan, R.M.: Quality of well-being before and after antibiotic treatment of pulmonary exacerbation in patients with cystic fibrosis. Chest 98(5), 1081–1084 (1990)
Orenstein, D.M., Nixon, P.A., Ross, E.A., Kaplan, R.M.: The quality of well-being in cystic fibrosis. Chest 95(2), 344–347 (1989)
Pangalila, R.F., van den Bos, G.A.M., Stam, H.J., van Exel, N.J.A., Brouwer, W.B.F., Roebroeck, M.E.: Subjective caregiver burden of parents of adults with Duchenne muscular dystrophy. Disabil. Rehabil. 34(12), 988–996 (2012)
Péntek, M., Gulácsi, L., Brodszky, V., Baji, P., Boncz, I., Pogány, G., et al.: Social/economic costs and health-related quality of life of mucopolysaccharidosis patients and their caregivers in Europe. Eur. J. Health Econ. 17(Suppl 1), 89–98 (2016)
Petrou, S., Kupek, E.: Estimating preference-based health utilities index mark 3 utility scores for childhood conditions in England and Scotland. Med. Decis. Making 29(3), 291–303 (2009)
Pintos-Morell, G., Blasco-Alonso, J., Couce, M.L., Gutiérrez-Solana, L.G., Guillén-Navarro, E., O’ Callaghan, M., et al.: Elosulfase alfa for mucopolysaccharidosis type IVA: real-world experience in 7 patients from the Spanish Morquio-A early access program. Mol. Genet. Metab. Rep. 15, 116–120 (2018)
Porte, F., Cottin, V., Catella, L., Luciani, L., Le Lay, K., Bénard, S.: Health economic evaluation in idiopathic pulmonary fibrosis in France. Curr. Med. Res. Opin. 34(10), 1731–1740 (2018)
Ramaswami, U., Stull, D.E., Parini, R., Pintos-Morell, G., Whybra, C., Kalkum, G., et al.: Measuring patient experiences in Fabry disease: validation of the Fabry-specific pediatric health and pain questionnaire (FPHPQ). Health Qual. Life Outcomes 10, 116 (2012)
Ramsey, S.D., Patrick, D.L., Albert, R.K., Larson, E.B., Wood, D.E., Raghu, G.: The cost-effectiveness of lung transplantation. A pilot study. Chest 108(6), 1594–1601 (1995)
Rice, S., Albani, V., Minos, D., Fattakhova, G., Mells, G.F., Carbone, M., et al.: Effects of primary biliary cholangitis on quality of life and health care costs in the United Kingdom. Clin. Gastroenterol. Hepatol. 19(4), 768-776.e10 (2021)
Rombach, S.M., Hollak, C.E.M., Linthorst, G.E., Dijkgraaf, M.G.W.: Cost-effectiveness of enzyme replacement therapy for Fabry disease. Orphanet J. Rare Dis. 8, 29 (2013)
Salisbury, M.L., Conoscenti, C.S., Culver, D.A., Yow, E., Neely, M.L., Bender, S., et al.: Antifibrotic drug use in patients with idiopathic pulmonary fibrosis. Data from the IPF-PRO registry. Ann. Am. Thorac. Soc. 17(11), 1413–1423 (2020)
Santana, M.J., Feeny, D., Ghosh, S., Lien, D.C.: Patient-reported outcome 2 years after lung transplantation: does the underlying diagnosis matter? Patient Relat. Outcome Meas. 3, 79–84 (2012)
Selvadurai, H.C., Blimkie, C.J., Meyers, N., Mellis, C.M., Cooper, P.J., Van Asperen, P.P.: Randomized controlled study of in-hospital exercise training programs in children with cystic fibrosis. Pediatr. Pulmonol. 33(3), 194–200 (2002)
Shulgina, L., Cahn, A.P., Chilvers, E.R., Parfrey, H., Clark, A.B., Wilson, E.C.F., et al.: Treating idiopathic pulmonary fibrosis with the addition of co-trimoxazole: a randomised controlled trial. Thorax 68(2), 155–162 (2013)
Singer, J.P., Katz, P.P., Soong, A., Shrestha, P., Huang, D., Ho, J., et al.: The effect of lung transplantation on health related quality of life in the era of the lung allocation score: a US prospective cohort study. Am. J. Transplant. 17(5), 1334–1345 (2017)
Singer, L.G., Chowdhury, N.A., Faughnan, M.E., Granton, J., Keshavjee, S., Marras, T.K., et al.: Effects of recipient age and diagnosis on health-related quality-of-life benefit of lung transplantation. Am. J. Respir. Crit. Care Med. 192(8), 965–973 (2015)
Smith, A.B., Retzler, J., Taylor, M.J.: Standard gamble to derive utility health states for limbal stem cell deficiency. Clinicoecon. Outcomes Res. 12, 535–546 (2020)
Solem, C.T., Vera-Llonch, M., Liu, S., Botteman, M., Castiglione, B.: Impact of pulmonary exacerbations and lung function on generic health-related quality of life in patients with cystic fibrosis. Health Qual. Life Outcomes 14, 63 (2016)
Sundh, J., Bornefalk, H., Sköld, C.M., Janson, C., Blomberg, A., Sandberg, J., et al.: Clinical validation of the Swedish version of Dyspnoea-12 instrument in outpatients with cardiorespiratory disease. BMJ Open Resp. Res. 6(1), e000418 (2019)
Suri, R., Metcalfe, C., Lees, B., Grieve, R., Flather, M., Normand, C., et al.: Comparison of hypertonic saline and alternate-day or daily recombinant human deoxyribonuclease in children with cystic fibrosis: a randomised trial. Lancet 358(9290), 1316–1321 (2001)
Szentes, B.L., Kreuter, M., Bahmer, T., Birring, S.S., Claussen, M., Waelscher, J., et al.: Quality of life assessment in interstitial lung diseases: a comparison of the disease specific K-BILD with the generic EQ-5D-5L. Respir. Res. 19(1), 101 (2018)
Tsai, A.P.Y., Hur, S.A., Wong, A., Safavi, M., Assayag, D., Johannson, K.A., et al.: Minimum important difference of the EQ-5D-5L and EQ-VAS in fibrotic interstitial lung disease. Thorax 76(1), 37–43 (2021)
Tsuboi, K., Yamamoto, H.: Clinical observation of patients with Fabry disease after switching from agalsidase beta (Fabrazyme) to agalsidase alfa (Replagal). Genet. Med. 14(9), 779–786 (2012)
Tzanakis, N., Samiou, M., Lambiri, I., Antoniou, K., Siafakas, N., Bouros, D.: Evaluation of health-related quality-of-life and dyspnea scales in patients with idiopathic pulmonary fibrosis. Correlation with pulmonary function tests. Eur. J. Intern. Med. 16(2), 105–112 (2005)
Van der Ploeg, C.P.B., Blom, M., Bredius, R.G.M., van der Burg, M., Schielen, P.C.J.I., Verkerk, P.H., et al.: Cost-effectiveness of newborn screening for severe combined immunodeficiency. Eur. J. Pediatr. 178(5), 721–729 (2019)
van Dussen, L., Biegstraaten, M., Hollak, C.E.M., Dijkgraaf, M.G.W.: Cost-effectiveness of enzyme replacement therapy for type 1 Gaucher disease. Orphanet J. Rare Dis. 9, 51 (2014)
Vasiliadis, H.M., Collet, J.P., Penrod, J.R., Ferraro, P., Poirier, C.: A cost-effectiveness and cost-utility study of lung transplantation. J. Heart Lung Transplant. 24(9), 1275–1283 (2005)
Wapenaar, M., Patel, A.S., Birring, S.S., van Domburg, R.T., Wp, B.E., Vindigni, V., et al.: Translation and validation of the king’s brief interstitial lung disease (K-BILD) questionnaire in French, Italian, Swedish, and Dutch. Chron. Respir. Dis. 14(2), 140–150 (2017)
Whiting, P., Al, M., Burgers, L., Westwood, M., Ryder, S., Hoogendoorn, M., et al.: Ivacaftor for the treatment of patients with cystic fibrosis and the G551D mutation: a systematic review and cost-effectiveness analysis. Health Technol. Assess. 18(18), 1–106 (2014)
Wixner, J., Mundayat, R., Karayal, O.N., Anan, I., Karling, P., Suhr, O.B., THAOS investigators: THAOS: gastrointestinal manifestations of transthyretin amyloidosis - common complications of a rare disease. Orphanet J. Rare Dis. 9, 61 (2014)
Wyatt, K., Henley, W., Anderson, L., Anderson, R., Nikolaou, V., Stein, K.: The effectiveness and cost-effectiveness of enzyme and substrate replacement therapies: a longitudinal cohort study of people with lysosomal storage disorders. Health Technol. Assess. 16(39), 1–543 (2012)
Yi, M.S., Britto, M.T., Wilmott, R.W., Kotagal, U.R., Eckman, M.H., Nielson, D.W., et al.: Health values of adolescents with cystic fibrosis. J. Pediatr. 142(2), 133–140 (2003)
Younossi, Z.M., Boparai, N., McCormick, M., Price, L.L., Guyatt, G.: Assessment of utilities and health-related quality of life in patients with chronic liver disease. Am. J. Gastroenterol. 96(2), 579–583 (2001)
Zisman, D.A., Schwarz, M., Anstrom, K.J., Collard, H.R., Flaherty, K.R., Hunninghake, G.W., Idiopathic Pulmonary Fibrosis Clinical Research: A controlled trial of sildenafil in advanced idiopathic pulmonary fibrosis. N Engl J Med. 363(7), 620–628 (2010)
Zuluaga-Sanchez, S., Teynor, M., Knight, C., Thompson, R., Lundqvist, T., Ekelund, M., et al.: Cost effectiveness of nusinersen in the treatment of patients with infantile onset and later onset spinal muscular atrophy in Sweden. Pharmacoeconomics 37(6), 845–865 (2019)
Beusterien, K.M., Davies, J., Leach, M., Meiklejohn, D., Grinspan, J.L., O’Toole, A., et al.: Population preference values for treatment outcomes in chronic lymphocytic leukaemia: a cross-sectional utility study. Health Qual. Life Outcomes 8, 50 (2010)
Tolley, K., Goad, C., Yi, Y., Maroudas, P., Haiderali, A., Thompson, G.: Utility elicitation study in the UK general public for late-stage chronic lymphocytic leukaemia. Eur. J. Health Econ. 14(5), 749–759 (2013)
Büchter, R.B., Weise, A., Pieper, D.: Development, testing and use of data extraction forms in systematic reviews: a review of methodological guidance. BMC Med. Res. Methodol. 20(1), 259 (2020)
Kennedy-Martin, M., Slaap, B., Herdman, M., van Reenen, M., Kennedy-Martin, T., Greiner, W., et al.: Which multi-attribute utility instruments are recommended for use in cost-utility analysis? A review of national health technology assessment (HTA) guidelines. Eur. J. Health Econ. 21(8), 1245–1257 (2020)
Acknowledgements
The authors are grateful to Dr Karen Facey and Dr Amanda Whittal for providing useful comments on this manuscript.
Funding
Open access funding provided by Università Commerciale Luigi Bocconi within the CRUI-CARE Agreement. This research was funded by the European Commission’s Horizon 2020 research and innovation programme and was undertaken under the auspices of IMPACT-HTA (Grant number 779312). The results presented here reflect the authors’ views and not the views of the European Commission. The European Commission is not liable for any use of the information communicated.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
MM has no conflict of interest. EN reports personal fees from Dolon Ltd outside the submitted work and has no conflict of interest. MD has conducted research and consulting on rare diseases outside the submitted work.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Meregaglia, M., Nicod, E. & Drummond, M. The estimation of health state utility values in rare diseases: do the approaches in submissions for NICE technology appraisals reflect the existing literature? A scoping review. Eur J Health Econ 24, 1151–1216 (2023). https://doi.org/10.1007/s10198-022-01541-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10198-022-01541-y