Abstract
Background and Aims
Patient-reported outcome measures (PROMs) for children (aged ≤ 18 years) present methodological challenges. PROMs can be categorised by their diverse underlying conceptual bases, including functional, disability and health (FDH) status; quality of life (QoL); and health-related quality of life (HRQoL). Some PROMs are designed to be accompanied by preference weights. PROMs should account for childhood developmental differences by incorporating age-appropriate health/QoL domains, guidance on respondent type(s) and design. This systematic review aims to identify generic multidimensional childhood PROMs and synthesise their characteristics by conceptual basis, target age, measurement considerations, and the preference-based value sets that accompany them.
Methods
The study protocol was registered in the Prospective Register of Systematic Reviews (CRD42021230833), and reporting followed Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines. We conducted systematic database searches for generic multidimensional childhood PROMs covering the period 2012–2020, which we combined with published PROMs identified by an earlier systematic review that covered the period 1992–2011. A second systematic database search identified preference-based value sets for generic multidimensional PROMs. The PROMs were categorised by conceptual basis (FDH status, QoL and HRQoL) and by target age (namely infants and pre-schoolers aged < 5 years, pre-adolescents aged 5–11, adolescents aged 12–18 and multi-age group coverage). Descriptive statistics assessed how PROM characteristics (domain coverage, respondent type and design) varied by conceptual basis and age categories. Involvement of children in PROM development and testing was assessed to understand content validity. Characteristics of value sets available for the childhood generic multidimensional PROMs were identified and compared.
Results
We identified 89 PROMs, including 110 versions: 52 FDH, 29 QoL, 12 HRQoL, nine QoL-FDH and eight HRQoL-FDH measures; 20 targeted infants and pre-schoolers, 29 pre-adolescents, 24 adolescents and 37 for multiple age groups. Domain coverage demonstrated development trajectories from observable FDH aspects in infancy through to personal independence and relationships during adolescence. PROMs targeting younger children relied more on informant report, were shorter and had fewer ordinal scale points. One-third of PROMs were developed following qualitative research or surveys with children or parents for concept elicitation. There were 21 preference-based value sets developed by 19 studies of ten generic multidimensional childhood PROMs: seven were based on adolescents’ stated preferences, seven were from adults from the perspective of or on behalf of the child, and seven were from adults adopting an adult’s perspective. Diverse preference elicitation methods were used to elicit values. Practices with respect to anchoring values on the utility scale also varied considerably. The range and distribution of values reflect these differences, resulting in value sets with notably different properties.
Conclusion
Identification and categorisation of generic multidimensional childhood PROMs and value sets by this review can aid the development, selection and interpretation of appropriate measures for clinical and population research and cost-effectiveness-based decision-making.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Our systematic review identified 110 versions of generic multidimensional patient-reported outcome measures (PROMs) for children (aged ≤ 18 years) spanning childhood age groups and conceptual bases of functional, disability and health status, quality of life and health-related quality of life. |
A supplementary systematic review identified 21 preference-based value sets for ten PROMs designed to be accompanied by preference-based value sets. |
Our catalogues of (1) PROMs categorised by target age group, conceptual bases and related characteristics (domain coverage, respondent type and design) and (2) value sets appraised for their development and statistical features can aid the development, selection and interpretation of appropriate childhood PROMs for clinical and population research and cost-effectiveness-based decision-making. |
1 Background
Patient-reported outcome measures (PROMs) enable direct assessment of health status, health-related quality of life (HRQoL) or quality of life (QoL) by patients or individuals [1,2,3]. They are used extensively as outcome instruments and can support patient-centred care [2, 4]. The use of PROMs in childhood populations (aged ≤ 18 years) presents methodological challenges compared to application in adults. One challenge is to account for age-based biopsychosocial developmental differences between children and adults and across children of different ages [5,6,7]. Distinguishing the age or age group of the target childhood population is important for content validity or concept coverage of the childhood PROM, its capacity for child self-report, appropriate design, and the culturally appropriate age for child self-report [2].
There are many ways that PROMs can be categorised. They can be divided into generic measures and disease or condition-specific measures, and further into domain-specific measures and multidimensional measures [4, 8]. Generic measures have the advantage of allowing comparisons across conditions and interventions. Multidimensional measures can be both generic and condition-specific and capture the multifaceted nature of concepts including health, QoL or HRQoL. PROMs can thus be further categorised by the concept(s) they seek to measure through their descriptive systems. There are multiple definitions of these concepts in the literature, with terms often used interchangeably [3, 9]. In this paper, we draw on the categories arising from a synthesis of definitions of functioning, disability, health, HRQoL and QoL used by the World Health Organization (WHO). In this taxonomy, a QoL/HRQoL measure can be distinguished from a functional, disability and health (FDH) measure by its reflection of the individual respondent’s perception, or subjective judgement of importance, of their assessed status [3]. Specifically, an FDH measure captures the ‘interactions among body structures and function, and activities and participation in the context of the environment and personal factors’, whilst QoL is ‘a person’s perception of [his/her] position in life’, where perception involves a subjective judgement over how the position relates to his/her goals, expectations, standards, enjoyment or concerns and not just a self-report of the position. HRQoL represents the individual’s ‘perception of his/her health and health-related states’ where ‘health’ represents a narrower domain than ‘position in life’ (p. 1086) [3].
PROMs can be accompanied by preference-based value sets which produce an overall index when applied to the multidimensional states generated by the descriptive systems [8]. PROMs accompanied by preference-based value sets are often referred to as multi-attribute utility instruments in the literature [10]. The value sets accompanying these measures use stated preference methods including standard gamble (SG) and time trade-off (TTO) to trade-off between a health state and mortality risk or life expectancy, respectively [11]. A key aspect of these methods is their ability to yield values that are anchored on a scale with 0 = dead and 1 = full health, as required for the use of values for estimating quality-adjusted life years (QALYs) in cost-utility analysis. Negative values are theoretically possible and represent health states considered worse than being dead. Discrete choice experiments (DCE) can also be used to elicit stated preferences for value sets for PROMs, although the resulting values lie on a latent scale unless accompanied by additional methods to allow anchoring at dead = 0 [12]. Decision makers around the world have shown a preference for country-specific value sets reflecting underlying differences in preferences of different populations [13,14,15].
There are considerable methodological challenges that arise in valuing childhood PROMs. Among them, there is contention around whose preferences are relevant, with some instruments offering both adult- and child-elicited value sets [16]. Eliciting values from children themselves may help to ensure that health services are relevant to their needs [17, 18], particularly when adults’ and children’s preferences differ [19]. However, there are practical and ethical issues surrounding the age at which children can be asked to participate in stated preference studies, especially where these include tasks that involve trade-offs with life expectancy. Both SG and TTO may impose high cognitive and/or emotional burden on children relative to DCE or best–worst scaling (BWS) tasks [17]. Where values are used to inform the allocation of healthcare resources financed primarily from taxation, a common normative position is that these should be based on the preferences of the adult general public [12, 20]. However, valuing childhood PROMs using adults’ preferences also raises issues. For example, it is unclear whether the respondent should be asked to think about themselves as a child, their child or another child [20].
Several systematic reviews of generic multidimensional childhood PROMs have been published since 2000 [3, 4, 21,22,23,24,25,26]. The most recent, by Janssens et al., identified 63 unique versions of PROMs (including eight accompanied by value sets) published between 1992 and 2011 [4]. Their review tabulated the characteristics of PROMs, including target age range, respondent type, domain coverage and several design features (e.g. number of items, response options and recall period) [4]. That review also mapped the domains of the PROMs onto the WHO’s International Classification of Functioning, Disability and Health, Child and Youth version (ICF-CY) framework [27], but noted the shortcoming of conflating the domains of FDH and QoL/HRQoL measures that have a different conceptual basis. It did not distinguish between types of informant response as recommended by the International Society for Pharmacoeconomics and Outcomes Research (ISPOR) guideline, i.e. proxy if the informant infers the child’s perception of health/QoL states and observer if the informant focuses on observable traits without inferring perception [2]. Moreover, little attention was paid to differences in characteristics of measures across age groups, or the value set features. Fayed and colleagues distinguished between FDH and QoL/HRQoL measures; however, their search period was limited to 2004–2008 [3]. Cremeens et al. [21] and Grange et al. [24] limited their reviews to 3- to 8-year-olds and < 5 year olds, respectively, which hinders comparisons of PROM characteristics across all childhood age groups. Finally, Chen and Ratcliffe previously reported on the features of nine childhood PROMs accompanied by value sets, but this was not a systematic review [10]. These gaps in previous studies inform the aim and objectives of this systematic review.
2 Aim and Objectives
This systematic review aims to generate a comprehensive catalogue of generic multidimensional childhood PROMs that rely on child self- and/or informant-report (hereafter childhood PROMs for brevity in accordance with the ISPOR guideline [2]) and the value sets that accompany them. In doing so, we aim to inform the development, selection and interpretation of childhood PROMs for clinical and population research and cost-effectiveness-based decision-making. Accordingly, the objectives are to:
-
1.
Systematically identify generic multidimensional childhood PROMs and preference-based value sets that accompany them.
-
2.
Categorise the PROMs according to conceptual basis and target age group and evaluate their methodological features according to the ISPOR good research practice recommendations on the use of childhood PROMs [2].
-
3.
Catalogue the value sets that exist for childhood PROMs, compare the methods that have been used to produce them and report the characteristics of the resulting value sets.
3 Methods
A pre-specified protocol outlining the systematic review methods was developed and registered with the Prospective Register of Systematic Reviews (CRD42021230833). For reporting purposes, the review followed the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [28]. See the Supplementary Information for the PRISMA checklist for this paper (see the electronic supplementary material).
3.1 Data Sources and Study Selection
Two independent systematic database searches were conducted. The first aimed to identify primary development studies for generic multidimensional childhood PROMs published between 1 January 2012 and 16 October 2020. Studies identified in the search were pooled with those published between 1 January 1992 and 20 March 2012 identified in the earlier systematic review by Janssens et al. [4]. A second search was conducted without date limits on 18 February 2021 to identify preference-based value sets for generic multidimensional childhood PROMs. This was constructed around measures identified by the first search that reported the development of a value set or the objective to develop a value set or to inform economic evaluations. Developers of PROMs were also contacted to confirm whether, to the best of their knowledge, there was no missing published value set.
Both systematic searches covered seven academic databases (Medline, Embase, PsycInfo via Ovid, EconLit via Proquest, CINAHL via EBSCOHost, Scopus and Web of Science) and one grey literature database (PROQOLID). The Medline search strategy for the first and second searches are presented in Tables A and B in the Supplementary Information (see the electronic supplementary material). Searches were limited to English language texts. In both searches, references were imported into Endnote X9 and duplicates were removed. References were then exported into Covidence [29], where two researchers independently reviewed the titles and abstracts of identified articles. If an article received two approvals, it proceeded to the next stage, with disagreements referred to a third reviewer for the final assessment. Full text articles were independently reviewed by two reviewers with disagreements again referred to the third reviewer for final assessments.
The inclusion criteria for the systematic search of PROMs were as follows: studies reporting the development of a generic multidimensional childhood (aged ≤ 18 years) PROM and studies published in the English language. Conference abstracts with sufficient information (i.e. containing at least the acronym/name of the instrument(s), the aim/motivation of development, whether a general childhood population was targeted, and the multidimensionality of the instrument) were included alongside full text articles. Studies in adults only and animals were excluded, as were commentaries, single case reports and secondary application of PROMs. References of included studies were searched, including those published before 1 January 2012. The inclusion criteria for the systematic search to identify value sets were as follows: studies reporting the development of a value set for one or more generic multidimensional childhood PROM; the value set was anchored on a 0–1 scale required for the use of the values in QALY estimation; and studies (including full text articles and a user’s manual [no conference abstracts were identified]) published in the English language. Studies that only reported values for a sub-set of the health states described by a childhood PROM were excluded.
3.2 Data Extraction
For the first review that identified generic multidimensional childhood PROMs, data for each measure were extracted independently by two reviewers per article from a pool of eight reviewers, with disagreements resolved through discussion. For the second review that identified preference-based value sets, data for each value set were extracted by one reviewer, with a 20% check performed by a second reviewer. For the identified generic multidimensional childhood PROMs, the following characteristics were extracted according to proformas in Excel: acronym/name of the instrument, name of author(s)/developer(s), development year and country(ies), aim/motivation of development, target age range, background concept (e.g. definition of health/QoL adopted before commencing PROM development), development methods (e.g. focus groups with children for item selection), role of children in development, level of child’s perception captured, respondent type (e.g. child, proxy, observer), administration mode (e.g. by self, by interviewer), recall period, domains covered, number of items, response options and method for score generation (other than value set application).
For the identified value sets of the generic multidimensional childhood PROMs, the following characteristics were extracted: acronym/name of the instrument, name of author(s)/developer(s), valuation year and country(ies), methods for selecting valued health states, number of valued health states, sampling methods for valuation including target population, recruitment strategies, response rates, sample size, representativeness of sample and reasons for exclusion, and stated preference data collection methods. The latter included elicitation technique(s) (e.g. SG, TTO, DCE), the approach taken to anchoring at 0–1, characteristics of respondents, perspective and administration mode, and modelling methods. Finally, the properties of the value sets were examined (e.g. value range, proportion of health states with negative values). As some of these properties were not reported in the papers, we coded the value set algorithms based on the published results and calculated the index scores for all potential health states according to the classification system of the instruments.
3.3 Data Synthesis
Using a method similar to that of Fayed et al. [3], the generic multidimensional childhood PROMs identified were categorised by conceptual basis. The wording and response options for measure items were reviewed independently by two reviewers per article from a pool of eight reviewers. Measures with 75% or more of items that captured a child’s perception (i.e. relation to enjoyment, satisfaction, goals, expectations, standards or concerns) on health or broader position in life were labelled HRQoL or QoL measures, respectively. Measures with 25% or fewer items capturing a child’s perception were labelled FDH measures. Those with between 25 and 75% of items capturing a child’s perception were labelled hybrid HRQoL-FDH or QoL-FDH measures. Fayed and colleagues similarly described the hybrid nature of many measures: e.g. the General Health Questionnaire was classified as ‘FDH and QoL/HRQoL’, the HUI as ‘FDH (with one HRQoL/QoL attribute)’ and the KIDSCREEN as ‘HRQoL/QoL (with some functioning features)’. Instead of these mixed descriptions, this study specifies cut-off levels to allow for comparison across a wide range of measures.
Distinguishing between QoL and HRQoL was particularly challenging given the overlap between health and position in life. Indeed, Fayed et al. do not make this distinction in their results, grouping QoL and HRQoL under the same category [3]. A two-step pragmatic approach was taken: (1) refer to the stated aim/motivation of the study prior to measure development or use (e.g. some studies gave the definition of QoL or HRQoL and underlying constructs [30, 31]), and if still unclear, then (2) categorise as HRQoL measures if more than 50% of the domains cover body functioning, disabilities and daily activities (e.g. mobility, pain, vision, anxiety, depressive symptoms, chronic illness, dressing and grooming) that would constitute the concept of FDH as defined by Fayed et al. [3].
When step (1) in this pragmatic approach could not be applied (i.e. the study used conceptual labels without clearly defining them), we did not automatically apply the conceptual labels given to PROMs by the developer(s). Fayed et al. documented the discrepancies between the conceptual bases as labelled by developers and secondary applications of PROMs and those as classified by their review. For example, Fayed et al. noted that the HUI and the PedsQL were labelled as HRQoL/QoL measures in the primary and secondary literature but are more accurately labelled FDH measures [3]. In the current review, conceptual bases were similarly assigned according to extracted data rather than existing labels (unless clearly defined), but we did not document the discrepancies as was done by Fayed et al.
For measures that use informant responses, the informant type was classified according to the ISPOR guideline [2], namely proxy for QoL/HRQoL measures (since informants infer the child’s subjective perception) and observer for FDH measures (since no such inference is made). For hybrid QoL/HRQoL-FDH measures, the informant type was labelled hybrid proxy-observer.
Following the ISPOR good research practices guideline [2], we explored differences in measurement characteristics for PROMs targeted at different age groups. The ISPOR guideline’s age cut-offs were used to categorise the measures by their target age group: less than 5 years, covering infants, toddlers or pre-schoolers; 5–7 years for younger pre-adolescents; 8–11 years for older pre-adolescents; and 12–18 years for adolescents. We then explored how the characteristics of the measures (including range of domains covered, respondent and informant type, and other design features) varied by conceptual basis and target age. In cases where target ages were not clearly specified, inferences were made from the study aim and development methods to assign the target age category.
The level of content validity of each PROM was assessed according to whether children of the target age were involved in (1) the development of the measure, including qualitative research for domain and item elicitation, and (2) assessment of comprehension using cognitive interviews and/or pilot/feasibility studies. Note that appraisals of other psychometric properties such as construct validity and internal consistency are not reported here but in forthcoming work. Cultural issues present in the initial instrument development (as opposed to subsequent translations and secondary cross-cultural adaptations of the developed instrument) were similarly described.
For the review of value sets for generic multidimensional childhood PROMs, data were analysed using narrative synthesis. Summary tables were used to describe the bibliography/setting, sampling methods, preference data collection methods and statistical features of the identified value sets. Separate summaries of the modelling methods are presented for two categories of preference elicitation techniques: TTO, SG and rating scales (RS); DCEs and BWS.
4 Results
4.1 Search Results
Figure 1 shows the PRISMA flow diagram for the first search that identified primary studies developing generic multidimensional childhood PROMs published between 1 January 2012 and 16 October 2020.
Thirty-four eligible studies were identified that described 26 PROMs. Sixty-three PROMs included in the review by Janssens et al. [4] that met the inclusion criteria were included to give 89 PROMs in total. Fourteen PROMs were accompanied or designed to be accompanied by preference-based value sets. Figure 2 shows the PRISMA flow diagram for the identification of value sets for these 14 PROMs. Nineteen studies were identified in total that described 21 value sets.
4.2 Overview of Generic Multidimensional Childhood Patient-Reported Outcome Measures (PROMs)
Table 1 provides an overview of the characteristics of the identified generic multidimensional childhood PROMs that are not accompanied by or designed to be accompanied by preference-based value sets, while Table 2 does so for PROMs designed to be accompanied by preference-based value sets. Some measures have several versions, and Table 1 includes all distinct versions, providing a total of 110 unique measures (for example, the PedsQL 4.0 Generic Core Scales are composed of age-specific modules for toddlers, young children, children and teens [32]). Of the 110 measures, 52 are FDH measures, 29 are QoL measures, 12 are HRQoL measures, nine are hybrid QoL-FDH measures, and eight are hybrid HRQoL-FDH measures. Seventeen measures (reflecting two versions each for HUI2 and HUI3 and two variants of the EQ-5D-Y) are designed to be accompanied by preference-based value sets. The measures were primarily developed in high-income countries (using the 2021 World Bank classifications [33]), with only seven (6%) developed in lower- or middle-income country (LMIC) settings as part of an international development process [34,35,36,37,38,39,40].
4.3 Characteristics of Measure by Age Category
Age groups 5–7 and 8–11 were combined for the subsequent descriptions of measure characteristics because relatively few measures targeted these age groups. This generated three rather than four age categories for the subsequent descriptions. Overall, there were 20 measures that targeted children less than 5 years, 29 that targeted children aged 5–11 years and 24 that targeted adolescents aged 12–18 years. Thirty-seven measures covered multiple age groups. Table C in the Supplementary Information (see the electronic supplementary material) organises the included measures by the above four target age group categories.
4.3.1 Conceptual Basis
Figure 3 shows the number of identified PROMs by age category according to their conceptual basis. The proportion of measures designed to elicit the child’s perception of their status (i.e. QoL, HRQoL and QoL/HRQoL-FDH measures) varied between four out of 20 (20%) for the infants, toddlers or pre-schoolers category and 18 out of 24 (75%) for adolescents. Twenty of 37 (54%) measures with multi-age group coverage were designed to elicit the child’s perception of their status.
4.3.2 Domain Coverage
Table 3 lists in alphabetical order the domains covered by generic multidimensional childhood PROMs stratified by age category and conceptual basis. Moreover, domains that are unique to each age category are presented in bold and underlined.
Thirty-one domains were unique to infants, toddlers or pre-schoolers, of which 25 were unique to FDH measures and six to HRQoL measures. There were ten unique domains for pre-adolescents, 25 for adolescents and 16 for the multi-age coverage category. The unique domains were concentrated in FDH measures (19 of 25; 76%) for infants, toddlers or pre-schoolers, and in QoL measures (16 of 25; 64%) for adolescents.
4.3.3 Respondent/Informant Type and Other Measure Characteristics
Table 4 summarises the respondent and informant types and design features of measures by age category.
Measures targeting younger age groups were more likely to rely on informant report. For example, of the 25 measures compatible with observer report only, 16 (64%) targeted children aged less than 5, and none targeted adolescents. Of the 44 measures designed primarily for child report, 18 (41%) targeted adolescents, and one (2%) targeted children aged less than 5.
Of 76 measures compatible with child self-report, 49 (65%) were designed for self-administration using non-electronic or electronic modes of data collection. Overall, there were 14 measures (13%) that used electronic data collection from children or informants: seven using computers at study locations [41,42,43,44,45,46,47], four using tablets or mobiles [48,49,50,51] and three via the internet from non-study locations [52, 53].
Shorter recall periods were used for younger age groups. Of 13 measures compatible with child self-report that had a recall period of current/today, seven (54%) targeted children aged 5–11, while two (15%) targeted adolescents.
The numbers of domains and items tended to be lower for measures targeting younger age groups. Of 42 measures compatible with child self-report that had more than five domains, 13 (31%) targeted adolescents and eight (19%) targeted children aged 5–11. Of 28 measures compatible with child self-report that had more than 30 items, ten (36%) targeted adolescents and six (21%) targeted children aged 5–11. Two measures used computerized adaptive tests [46, 47] that tailored the number of items according to respondents’ previous answers.
Most measures (102 of 110; 93%) used some form of ordinal scale with a pre-defined number of response points (e.g. Likert scale). Six measures used a visual analogue scale (VAS) alongside a Likert scale. Eleven measures used pictorial or narrative aids for child self-reports.
The vast majority of measures without value sets (78 of 93; 84%) used unweighted sums, averages or proportion of maximum scores as scoring methods. Four measures without value sets elicited importance scores from children, with items weighted to produce the overall QoL score [30, 49, 54, 55]. The KIDSCREEN and PROMIS groups of measures used item response theory scoring [44,45,46,47, 56,57,58,59].
4.4 Content Validity of Measures
Table D in the Supplementary Information (see the electronic supplementary material) describes the background concepts and development methods for measures, including whether and how children (or parents/primary caregivers in case of infants, toddlers or pre-schoolers) were involved in the development and pilot-testing processes. The development of 21 of the 110 measures (19%) involved qualitative research with or surveys of children for domain and item elicitation. Nine measures were based on adapting existing adult measures: one conducted focus groups with adolescents [37]; two cognitive interviews [6, 38]; one feasibility testing [60]; and five made little or no mention of children’s involvement [54, 61,62,63,64]. Twenty-three measures reported no involvement of children, relying on clinical and/or research expertise for domain/item elicitation, using statistical techniques for item selection (e.g. for short-form development), or provided little detail on development. Among measures with multi-age group coverage, nine of the 37 measures (24%) conducted qualitative research with children.
Seven measures addressed cultural issues in initial measure development. The ACHWM aimed to develop a culturally appropriate model of health and wellbeing for Aboriginal communities in Canada [48]. The QOLQA and QOLQA-Taiwan aimed to develop measures appropriate for Asian adolescents [35, 65]. The development of the IQI involved interviews with parents of infants aged 0–3 years from New Zealand, Singapore and the UK to assess cross-cultural interpretability [51]. Izutsu et al. adapted the existing English version of the WHOQOL-BREF for Bangladeshi adolescents and identified the need for interviewer administration due to low literacy rates in this context [36]. Finally, the KIDSCREEN measures were developed across 13 European countries specifically for cross-cultural relevance [57]; while the development of the EQ-5D-Y similarly included seven countries [38].
4.5 Characteristics of Preference-Based Value Sets for Childhood PROMs
The characteristics of 21 preference-based value sets developed by 19 studies of ten generic multidimensional childhood PROMs are summarised in Table 5. The CH-6D, CHSCS-PS, EQ-5D-Y-5L and TANDI, which have been designed as generic multidimensional childhood PROMs with the accompaniment of a value set as an objective, did not have value sets publicly available at the time of the review. The 21 value sets were predominately developed in high-income countries with the exception of those developed for Fiji and Tonga (AQoL-6D Adolescents) [37] and China (CHU9D) [66]. BWS and DCEs increased in frequency of use in recent valuation studies, with TTO and SG techniques largely utilised at least a decade ago, unless used for the specific purpose of anchoring the DCE/BWS to the QALY scale. The respondent types from which the value sets were derived differ across measures, ranging from school children and students (16D, AQoL-6D, CHU9D [Australia and China]) through to parents/caregivers (17D, HUI2, IQI), with general population adult samples forming the most prevalent source of values overall (AHUHM, CHU9D [UK and The Netherlands], EQ-5D-Y, HUI2, HUI3, QWB, IQI).
The size of the samples from which the value sets were derived differed markedly between valuation studies, ranging from 115 (17D, parents of children [67]) to 4155 (EQ-5D-Y US value set, adults [68]). Some studies conducted complementary TTO experiments to assist with anchoring [66, 69,70,71] or used a particular type of DCE that includes a life year attribute (which is commonly referred to as DCETTO) [72] to assist with anchoring. Studies using BWS and DCE generally used larger sample sizes than more traditional approaches to health state valuation such as RS, TTO and SG (supplementary Figure S1, see the electronic supplementary material) [73]. Studies employing DCE and BWS approaches in relatively large samples also tended to be administered in self-complete online surveys. In contrast, TTO and SG were more likely to be administered in smaller samples and in an interview format due to their greater complexity and iterative nature [73].
Seven value sets were elicited from adolescents using RS for the 16D [18], TTO for AQoL-6D Adolescents (four value sets) [37] and BWS for CHU9D [66, 74]. One study elicited values from young adults to anchor the Ratcliffe 2015 value set for the CHU9D [74]. Amongst the remaining studies that sampled adult populations, five studies eliciting country-specific value sets for the EQ-5D-Y or HUI2 asked adult respondents to express their preferences from the perspective of a child aged 7 and 10 years [68] or 10 years [70, 71, 75, 76]. The HUI2 value sets [75, 76] added additional context by specifically asking the adult to imagine the child living in the health state for the remainder of their life expectancy, with death at age 70. Two value sets were elicited from adults on behalf of the child [67, 77]. The remaining seven studies [72, 78,79,80,81,82,83] elicited health state preferences from adults who were asked to value health states from the perspective of their own health.
There were noticeable variations in the features of the value sets depending on the setting and valuation protocols adopted. For example, there are several value sets for the CHU9D, including the UK general adult population value set based on SG [79], the Dutch general adult population value set based on the DCETTO [72] and Australian [74] and Chinese [66] adolescent value sets based on BWS. Variation in the rank order of the most important domains is evident and illustrated in Table 5, with adolescents in China ranking the ability to join in activities and tired domains as most important, adolescents in Australia placing greater importance on mental health domains (e.g. sad, annoyed) and adults in both the UK and the Netherlands placing greater importance on physical health domains (e.g. pain and sleep). However, these differences may be due to the differences in valuation protocol, the sample characteristics and differences in preferences between countries
The range of health state values and the percentage of negative health state values (health states considered worse than being dead) differs across value sets within measures (Fig. 4 shows the value range by perspective, and supplementary Figure S2 shows the range by elicitation method). For example, the worst health state described by the CHU9D had a value of 0.337 in the UK general population value set, in contrast to the Australian [69], Chinese [66] and The Netherlands [72] value sets, where the worst health state had values of −0.1059, 0.0563 and −0.568, respectively. The UK general population and Chinese adolescent CHU9D value sets have no health states valued at worse than being dead, whereas the Australian adolescent and The Netherlands general population value sets have 0.19 and 5.17% of health states, respectively, valued at worse than being dead.
Similarly, when drawing comparisons between different instruments, marked differences are evident in the range of values for value sets and in the proportions of health states that are valued at worse than being dead, with most studies reporting no negative health state values [18, 37, 66, 67, 71, 77, 79, 83]. However, some studies report relatively high proportions of health states valued at worse than being dead: e.g. 21% for the Slovenian EQ-5D-Y value set [70], whilst the publicly available Canadian algorithm for the HUI3 suggests that as many as 78% of health states are valued as worse than being dead [80]. The latter’s value range and proportion of negative values is noticeably greater than all other value sets. A potential reason for the high proportion could be the multiplicative function used for HUI3 valuation tasks as compared to the more common additive functional form used by other measures (supplementary Figs. S3a and S3b). More broadly, differences in modelling approaches can be as important as differences in the stated-preference methods used in determining the overall characteristics of value sets. Supplementary Tables E and F contain further detail on the valuation tasks and modelling methods for studies using two classes of methods for value set development (RS, TTO and SG [Table E, see the electronic supplementary material]; and DCE and BWS [Table F, see the electronic supplementary material]).
5 Discussion
This systematic review has generated an up-to-date and comprehensive catalogue of generic multidimensional childhood PROMs and the value sets that accompany them. The outputs can be used to inform the development, selection and interpretation of measures and value sets across a range of research contexts, including research designed to inform patient-centred care and research designed to inform cost-effectiveness-based decision-making. Specifically, the description provided of PROM characteristics such as target age range and domain coverage—alongside evidence on their psychometric performance explored in a forthcoming systematic review—should aid decisions around measure selection and, where appropriate measures are currently not available, measure development. Likewise, the granulated descriptions and categorisations of measures and value sets should aid interpretation of the outputs produced by measures across a range of methodological characteristics.
The review is the first to comprehensively catalogue generic multidimensional childhood PROMs according to the conceptual basis using definitions identified and outlined by the author team. It hence builds on the previous work by Fayed et al., who highlighted ways in which the terms health status, QoL and HRQoL are often used interchangeably in the literature despite important conceptual differences [3]. This is similarly noted by Karimi and Brazier [9], although their suggestion that the term HRQoL be reserved only to describe health utilities does not distinguish between measurement (individual perception incorporated into the descriptive systems of HRQoL measures) and valuation (societal preferences used to derive value sets) considerations. The latter may be appropriate for informing societal resource allocation decisions, while the former is more suited to informing clinical decisions for individuals [84], although the dichotomy is not absolute. Therefore, appropriate selection of childhood PROMs for research and decision making necessitates careful consideration of the context within which they are to be used.
The ISPOR guidelines for application of childhood PROMs recommend end users consider the applicability of characteristics of measures across children of different ages [2]. For example, a key consideration is the health/QoL domain coverage of PROMs targeted at particular age groups. This review describes the methods adopted to elicit relevant domains and items from children of different ages or their parents/caregivers and updates the content validity assessment conducted by Janssens et al. [85]. Moreover, our review catalogues the resulting domains by conceptual basis and target age group, highlighting those that are unique to each category and aiding instrument selection by the end user’s target childhood population and domains of interest. Previous reviews have listed domain coverage, but only for individual instruments and not by conceptual and age categories [4, 10, 22, 25, 26]. The review identifies unique domains across developmental stages from infancy to adolescence, with a focus on observable FDH aspects (e.g. responsiveness, appetite) during the first 5 years of life replaced by increased focus on self-perception of activities and immediate relations during pre-adolescence (e.g. activity impairment, relation with siblings), followed by heightened awareness of personal independence and wider relations during adolescence (e.g. personal competence, opposite-sex relationships).
The emphasis on age-specific content should be balanced by the ability to generate outcomes that are comparable across childhood age groups and between children and adults; the latter is particularly important for research informing life-course healthcare decision-making [86]. The development of early childhood (age 1–5 years) versions of PROMIS sought to balance the two priorities by eliciting relevant domains and items from parents while minimising changes to the recall period and question wording of adult PROMIS measures [87]. Likewise, we identified nine measures that adapted existing adult measures, three after conducting qualitative research or cognitive interviews with children [6, 37, 38]. Significantly, two of these measures that were adapted with childhood input—QoL-C for age 4–9 years [6] and EQ-5D-Y for age 8–15 years [38]—are based on the EQ-5D, the reference case outcome measure for cost-utility analyses in the UK [13].
Another key age-dependent characteristic is the feasibility of child self-report, which is clearly dependent on not only chronological age, but also on children’s cognitive capacity and their educational abilities (reading, comprehension and writing skills). Unsurprisingly, only two of 20 measures targeting children aged less than 5 provide for self-report [88, 89], compared to 21 of 29 measures targeting pre-adolescents and all measures targeting adolescents. Where informant reports are used, this review distinguishes between proxy and observer report depending on the extent to which the informant is asked to infer a subjective perception over health/QoL items for the child. Hence, there is close overlap between the conceptual basis of the measure and informant type. This distinction is recommended by the ISPOR guideline [2], but is yet to be widely implemented. Janssens et al., for example, did not include the observer/proxy distinction in their discussion (p. 325) [4]. Moreover, based on empirical findings that child–informant agreement is poorer for subjective health/QoL constructs (e.g. emotion, pain) than observable ones (e.g. mobility, self-care) [90, 91], the ISPOR guideline recommends instruments involving observer report over those involving proxy report where informant report is found necessary (p. 470) [2]. Of the 61 identified measures compatible with informant report, 19 (31%) involved proxies; of the 33 measures compatible only with informant report (i.e. not designed for child self-report), nine (27%) involved proxies. This suggests that many existing measures do not adhere to the ISPOR recommendation, although more recently developed measures for infants and early childhood specifically cite this recommendation as the reason for including only observable domains [39, 87]. A key rationale for incorporating informant (specifically parent) report is that informants often have vital insights into the child’s health needs and medical history and hence this facilitates healthcare decision-making [2, 92]. Future research should examine to what extent this decision-making is impaired by limiting the informant focus to observable domains.
Incorporating age-appropriate instrument design is another ISPOR good research practice principle [2]. Electronic data collection is recommended as the preferred administration mode, particularly in pre-adolescent age groups where self-report is possible [2, 21, 93]. Accordingly, three measures specifically targeting pre-adolescents incorporated electronic data collection in self-reports [41, 42, 49] as did another seven with multi-age group coverage including pre-adolescents [43,44,45,46,47, 50, 53]. The ISPOR guideline notes the difficulties children face in comprehending extended periods and recommends a recall period of 24 h or less. A recent systematic review by Coombes et al. on the design of childhood PROMs in primary studies similarly recommends a recall period of 48 h or less for children aged 5–7 years [94]. These recommendations were applied (i.e. recall period of current/today or general) by half of the measures in our review that elicit pre-adolescent self-reports. The other half contains recall periods ranging from the past week to the past 2 months. Concerning instrument length, relatively few instrument developers were concerned about the administrative burden of long instruments and used this as a rationale for development of short forms [53, 55, 58, 95]. In all, the number of instruments adhering to design recommendations is generally low.
For childhood PROMs accompanied by preference-based value sets, a key challenge lies in how to elicit societal preference weights. This review reveals an increasing trend towards using ordinal approaches (e.g. DCE and BWS) in health state valuation. These approaches (in particular BWS) have different cognitive demands to iterative approaches and may be more suitable for children to self-complete [96]. Given death is normally not included in the hypothetical choice tasks, this also aligns with ethical considerations for children. It enables preferences of children to be directly elicited and has been adopted by the studies that derived value sets for the CHU9D in Australia and China [66, 74]. However, given the raw values from these ordinal tasks are estimated on a latent utility scale, there is a need to have a standalone valuation survey using traditional direct approaches to health state valuation (e.g. TTO or SG) with (normally) a small convenience sample to facilitate the re-scaling exercise. This enables the value sets to produce cardinal health state values compatible with the 0–1 (dead to full health) QALY scale (for a comprehensive overview, see [73, 97]). For the most recently published EQ-5D-Y value sets, a similar approach was used, but instead of using BWS, a DCE was adopted in the main valuation task [70, 71].
This review identified that the majority of value sets (14/21 value sets) were derived using adult samples; in half of these, the adult was asked to take the perspective of a child, and half were from the adult’s own perspective. Decisions about the appropriate perspective to adopt when developing value sets for economic evaluation need to consider the preference of health technology assessment (HTA) agencies or decision makers in each country. There are currently notable differences in the approaches taken for each country. For example, CHU9D value sets developed in the UK and The Netherlands are both based on the perspective and preference of the general public (aged ≥ 18 years) [72, 79]. The SG was used for the UK value set [79], whilst a DCETTO was used for The Netherlands value set [72]. The DCETTO does not require a separate task or data manipulation for states considered worse than dead [12, 98]; however, this approach was developed for adults, and may not be suitable for children and adolescents given ethical concerns about the inclusion of death. Notably, this issue also applies to the TTO. It will be important for future value set generation that the methods align to decision maker preferences as results will differ depending on approach taken. This also means that caution is required when comparing values generated across country value sets. Comparability of utilities for children with utilities generated for adults will also need to be considered by decision makers. The differences in perspectives used and normative judgments around appropriate perspectives will need to be balanced with considerations of comparability.
The review has also identified a trend over the last decade towards DCE and BWS exercises involving relatively larger samples and a reliance on online surveys. Whilst there are considerable practical advantages with online sampling, care is needed to ensure the quality of responses obtained, with protocols such as those developed by the EuroQol Group playing a critical role in quality assurance [99]. Qualitative studies such as ‘think aloud’ also provide information to understand participant decisions in the absence of a face-to-face interaction [100].
Across all categories of PROMs, regardless of the construct (or concept) being measured or target age(s), a noticeable feature is the scarcity of measures developed in LMIC settings (though this could be partly related to the reviews’ sole inclusion of English language studies). Only five LMICs were covered by PROM development and only three by value set development. A previous systematic review of studies applying generic childhood PROMs accompanied by value sets found a similar scarcity, with only 8% of samples coming from LMICs [16] despite the substantially higher childhood disease burdens in these settings [101]. Further development of PROMs and value sets relevant to LMICs is a pressing research need, particularly when many characteristics of PROMs are contingent upon local contexts (e.g. low literacy rates hindering self-administration [36]) and when national HTA agencies demand value sets derived from samples relevant to local decision-making [13].
5.1 Strengths and Limitations
The contributions and strengths of this systematic review relative to previous reviews of childhood PROMs have been discussed under specific themes above (e.g. categorising PROMs by conceptual basis and target age group). There are nonetheless limitations warranting further research as well as caveats that should be borne in mind by readers. First, in contrast to previous reviews [21, 22, 24,25,26, 102], this review did not assess the psychometric properties of identified PROMs. Assessment of psychometric properties is recommended for PROM selection in research and decision-making [2]. In common with the work conducted by Janssens et al. [85], a separate systematic search and synthesis of the psychometric properties of the PROMs identified by this review is planned. This review provides partial evidence of the content validity of PROMs (Table D, see the electronic supplementary material), which will be combined with evidence of other aspects of content validity such as relevance and comprehensibility, as well as evidence for other psychometric properties [103]. Second, the research is constrained by a lack of consensus surrounding terms that describe the conceptual bases underpinning PROMs [9]. Hence, the methods used in this work to establish the categories of FDH, QoL and HRQoL measures [3], and the cut-off levels for hybrid categories, may be disputed. That said, a sensitivity analysis that changed the cut-off levels required for the proportion of items within measures that captured a child’s perception for them to be labelled QoL/HRQoL-FDH hybrid measures (from 25–75% to 33–67%) altered the label of only one measure (Kiddy-KINDL parent questionnaire [89]), and this had minimal impact on subsequent analyses.
Third, this review covered generic multidimensional childhood PROMs only and did not cover multidimensional childhood PROMs developed for specific conditions. Future research could usefully generate a parallel catalogue of condition-specific multidimensional childhood PROMs to inform research in specific clinical areas [104]. Fourth, the delineation of age and age categories throughout the paper is framed in chronological terms. Consideration of developmental age or markers of reading or educational ability could inform the selection of multidimensional childhood PROMs in some contexts, but this would require analyses of individual-level data from studies that also report developmental or educational outcomes. A final caveat is that this review does not report overall scores on the methodological or reporting quality of included studies through, for example, the Checklist for Reporting Valuation Studies (CREATE) score for valuation studies [105]. This was because CREATE was not developed to encompass values for childhood PROMs and misses key additional elements of the methodological choices required in this context. For PROM development studies, the planned systematic review of psychometric properties of measures will apply the Consensus-based Standards for the selection of health status Measurement Instruments (COSMIN) checklist to assess their methodological quality [103].
6 Conclusion
The catalogue of generic multidimensional childhood PROMs generated by this systematic review creates a valuable resource for researchers, practitioners and policy makers in selecting the most appropriate measure(s) for application within childhood populations according to their conceptual basis, target age, design and needs of the end user. This information should be viewed in conjunction with evidence surrounding the psychometric performance of measures to be presented in the follow-up systematic review. The description of PROM characteristics should inform decisions about the need to develop measures with alternative features or targeted at particular age groups. Moreover, the identified value sets covering all childhood age groups can be used to inform cost-utility analyses of childhood interventions. However, many methodological questions remain regarding the methods to use in valuing childhood PROMs, and this is currently a very active area of research. The availability of childhood PROMs and the value sets that accompany them have important roles in informing, respectively, individualised care and cost-effectiveness-based decision-making.
Change history
14 February 2022
A Correction to this paper has been published: https://doi.org/10.1007/s40273-022-01135-9
References
Johnston BC, Patrick DL, Devji T, Maxwell LJ, Bingham III CO, Beaton D, et al. Patient-reported outcomes. In: Higgins JPT TJ, Chandler J, Cumpston M, Li T, Page MJ, Welch VA, editors. Cochrane handbook for systematic reviews of interventions version 62 (updated February 2021): Cochrane; 2021.
Matza LS, Patrick DL, Riley AW, Alexander JJ, Rajmil L, Pleil AM, et al. Pediatric patient-reported outcome instruments for research to support medical product labeling: report of the ISPOR PRO good research practices for the assessment of children and adolescents task force. Value Health. 2013;16(4):461–79.
Fayed N, De Camargo OK, Kerr E, Rosenbaum P, Dubey A, Bostan C, et al. Generic patient-reported outcomes in child health research: a review of conceptual content using World Health Organization definitions. Dev Med Child Neurol. 2012;54(12):1085–95.
Janssens A, Coon JT, Rogers M, Allen K, Green C, Jenkinson C, et al. A systematic review of generic multidimensional patient-reported outcome measures for children, part I: descriptive characteristics. Value Health. 2015;18(2):315–33.
Petrou S. Methodological issues raised by preference-based approaches to measuring the health status of children. Health Econ. 2003;12(8):697–702. https://doi.org/10.1002/hec.775.
Thompson HL, Reville M-C, Price A, Reynolds L, Rodgers L, Ford T. The quality of life scale for children (QOL-C). J Child Serv. 2014;9(1):4–17. https://doi.org/10.1108/JCS-05-2013-0019.
Starfield B, Ensminger M, Riley A, McGauhey P, Skinner A, Kim S, et al. Adolescent health status measurement: development of the Child Health and Illness Profile. Pediatrics. 1993;91(2):430–435. https://doi.org/10.1542/peds.91.2.430.
Drummond MF, Sculpher MJ, Claxton K, Stoddart GL, Torrance GW. Methods for the economic evaluation of health care programmes. Oxford: Oxford University Press; 2015.
Karimi M, Brazier J. Health, health-related quality of life, and quality of life: what is the difference? Pharmacoeconomics. 2016;34(7):645–9.
Chen G, Ratcliffe J. A review of the development and application of generic multi-attribute utility instruments for paediatric populations. Pharmacoeconomics. 2015;33(10):1013–28. https://doi.org/10.1007/s40273-015-0286-7.
Torrance GW. Measurement of health state utilities for economic appraisal: a review. J Health Econ. 1986;5(1):1–30.
Bansback N, Brazier J, Tsuchiya A, Anis A. Using a discrete choice experiment to estimate health state utility values. J Health Econ. 2012;31(1):306–18.
National Institute for Health and Care Excellence. Guide to the methods of technology appraisal 2013. In: Excellence NIfHaC, editor. PMG92013.
Pharmaceutical Benefits Advisory Committee. Guidelines for preparing submissions to the Pharmaceutical Benefits Advisory Committee (Version 5). In: Pharmaceutical Benefits Advisory Committee DoH, editor. 2016.
Canadian Agency for Drugs and Technologies in Health. Guidelines for the economic evaluation of health technologies: Canada 4th Edition. In: Guidelines CMa, editor. 2017.
Kwon J, Kim SW, Ungar WJ, Tsiplova K, Madan J, Petrou S. Patterns, trends and methodological associations in the measurement and valuation of childhood health utilities. Qual Life Res. 2019;28(7):1705–24.
Ratcliffe J, Flynn T, Terlich F, Stevens K, Brazier J, Sawyer M. Developing adolescent-specific health state values for economic evaluation: an application of profile case best-worst scaling to the Child Health Utility 9D. Pharmacoeconomics. 2012;30(8):713–27. https://doi.org/10.2165/11597900-000000000-00000.
Apajasalo M, Sintonen H, Holmberg C, Sinkkonen J, Aalberg V, Pihko H, et al. Quality of life in early adolescence: a sixteen-dimensional health-related measure (16D). Qual Life Res. 1996;5(2):205–11.
Ratcliffe J, Stevens K, Flynn T, Brazier J, Sawyer MG. Whose values in health? An empirical comparison of the application of adolescent and adult values for the CHU-9D and AQOL-6D in the Australian adolescent general population. Value Health. 2012;15(5):730–6.
Rowen D, Rivero-Arias O, Devlin N, Ratcliffe J. Review of valuation methods of preference-based measures of health for economic evaluation in child and adolescent populations: where are we now and where are we going? Pharmacoeconomics. 2020;38(4):325–40.
Cremeens J, Eiser C, Blades M. Characteristics of health-related self-report measures for children aged three to eight years: a review of the literature. Qual Life Res. 2006;15(4):739–54.
Davis E, Waters E, Mackinnon A, Reddihough D, Graham HK, Mehmet-Radji O, et al. Paediatric quality of life instruments: a review of the impact of the conceptual framework on outcomes. Dev Med Child Neurol. 2006;48(4):311–8.
Eiser C, Morse R. A review of measures of quality of life for children with chronic illness. Arch Dis Child. 2001;84(3):205–11.
Grange A, Bekker H, Noyes J, Langley P. Adequacy of health-related quality of life measures in children under 5 years old: systematic review. J Adv Nurs. 2007;59(3):197–220.
Harding L. Children’s quality of life assessments: a review of generic and health related quality of life measures completed by children and adolescents. Clin Psychol Psychotherapy. 2001;8(2):79–96.
Solans M, Pane S, Estrada MD, Serra-Sutton V, Berra S, Herdman M, et al. Health-related quality of life measurement in children and adolescents: a systematic review of generic and disease-specific instruments. Value Health. 2008;11(4):742–64.
World Health Organization. International classification of functioning, disability, and health: children and youth version: ICF-CY. World Health Organization; 2007.
Moher D, Liberati A, Tetzlaff J, Altman DG, Group P. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Intern Med. 2009;151(4):264–9 (W64).
Veritas Health Innovation. Covidence systematic review software. Melbourne.
Etienne A-M, Dupuis G, Spitz E, Lemetayer F, Missotten P. The gap concept as a quality of life measure: validation study of the Child Quality of Life Systemic Inventory. Soc Indic Res. 2011;100(2):241–57.
Vogels T, Verrips G, Verloove-Vanhorick S, Fekkes M, Kamphuis R, Koopman H, et al. Measuring health-related quality of life in children: the development of the TACQOL parent form. Qual Life Res. 1998;7(5):457–65.
Varni JW, Seid M, Kurtin PS. PedsQL™ 4.0: reliability and validity of the Pediatric Quality of Life Inventory™ Version 4.0 Generic Core Scales in healthy and patient populations. Med Care. 2001;39:800–12.
World Bank. Data: World Bank Country and Lending Groups 2021 [31.5.2021]. Available from: https://datahelpdesk.worldbank.org/knowledgebase/articles/906519-world-bank-country-and-lending-groups.
Jiang S, Ngai SS-Y. Assessing multiple domains of child well-being: Preliminary development and validation of the multidimensional child well-being scale (MCWBS). Curr Psychol. 2020:1–12.
Wang X, Matsuda N, Ma H, Shinfuku N. Comparative study of quality of life between the Chinese and Japanese adolescent populations. Psychiatry Clin Neurosci. 2000;54(2):147–52.
Izutsu T, Tsutsumi A, Islam MA, Matsuo Y, Yamada HS, Kurita H, et al. Validity and reliability of the Bangla version of WHOQOL-BREF on an adolescent population in Bangladesh. Qual Life Res. 2005;14(7):1783–9.
Moodie M, Richardson J, Rankin B, Iezzi A, Sinha K. Predicting time trade-off health state valuations of adolescents in four Pacific countries using the Assessment of Quality-of-Life (AQoL-6D) instrument. Value Health. 2010;13(8):1014–27. https://doi.org/10.1111/j.1524-4733.2010.00780.x.
Wille N, Badia X, Bonsel G, Burström K, Cavrini G, Devlin N, et al. Development of the EQ-5D-Y: a child-friendly version of the EQ-5D. Qual Life Res. 2010;19(6):875–86.
Verstraete J, Ramma L, Jelsma J. Item generation for a proxy health related quality of life measure in very young children. Health Qual Life Outcomes. 2020;18(1):1–15.
Verstraete J, Lloyd A, Scott D, Jelsma J. How does the EQ-5D-Y proxy version 1 perform in 3, 4 and 5-year-old children? Health Qual Life Outcomes. 2020;18:1–10.
Eiser C, Vance Y, Seamark D. The development of a theoretically driven generic measure of quality of life for children aged 6–12 years: a preliminary report. Child Care Health Dev. 2000;26(6):445–56.
Bevans KB, Riley AW, Forrest CB. Development of the healthy pathways child-report scales. Qual Life Res. 2010;19(8):1195–214.
Ravens-Sieberer U, Heilmann M, Walleser S. Assessment of quality of life in young children with a Computer Assisted Touch Screen Program (CAT-Screen)-reliability, validity and feasibility. Qual Life Res. 2000:298.
Varni JW, Thissen D, Stucky BD, Liu Y, Gorder H, Irwin DE, et al. PROMIS® Parent Proxy Report Scales: an item response theory analysis of the parent proxy report item banks. Qual Life Res. 2012;21(7):1223–40.
DeWalt DA, Gross HE, Gipson DS, Selewski DT, DeWitt EM, Dampier CD, et al. PROMIS® pediatric self-report scales distinguish subgroups of children within and across six common pediatric chronic health conditions. Qual Life Res. 2015;24(9):2195–208.
Devine J, Otto C, Rose M, Barthel D, Fischer F, Mülhan H, et al. A new computerized adaptive test advancing the measurement of health-related quality of life (HRQoL) in children: the Kids-CAT. Qual Life Res. 2015;24(4):871–84.
Varni JW, Magnus B, Stucky BD, Liu Y, Quinn H, Thissen D, et al. Psychometric properties of the PROMIS® pediatric scales: precision, stability, and comparison of different scoring and administration options. Qual Life Res. 2014;23(4):1233–43.
Young NL, Wabano MJ, Burke TA, Ritchie SD, Mishibinijima D, Corbiere RG. A process for creating the Aboriginal Children’s Health and Well-being Measure (ACHWM). Can J Public Health. 2013;104(2):e136–41.
Touchèque M, Etienne A-M, Missotten P, Dupuis G. A comparison of a tablet version of the Quality of Life Systemic Inventory for Children (QLSI-C) to the standard paper version. Psychol Assess. 2016;28(6):780.
Kang E. Validity of Child Health-6 Dimension (Ch-6d) for adolescents. Value Health. 2016;19(7):A854. https://doi.org/10.1016/j.jval.2016.08.458.
Jabrayilov R, van Asselt AD, Vermeulen KM, Volger S, Detzel P, Dainelli L, et al. A descriptive system for the Infant health-related Quality of life Instrument (IQI): measuring health with a mobile app. PLoS ONE. 2018;13(8):e0203276.
Landgraf JM, van Grieken A, Raat H. Giving voice to the child perspective: psychometrics and relative precision findings for the Child Health Questionnaire self-report short form (CHQ-CF45). Qual Life Res. 2018;27(8):2165–76.
Forrest CB, Bevans KB, Pratiwadi R, Moon J, Teneralli RE, Minton JM, et al. Development of the PROMIS® pediatric global health (PGH-7) measure. Qual Life Res. 2014;23(4):1221–31.
Gullone E, Cummins RA. The Comprehensive Quality of Life Scale: a psychometric evaluation with an adolescent sample. Behav Change. 1999;16(2):127–39.
Seligson JL, Huebner ES, Valois RF. Preliminary validation of the brief multidimensional students’ life satisfaction scale (BMSLSS). Soc Indic Res. 2003;61(2):121–45.
Ravens-Sieberer U, Erhart M, Power M, Auquier P, Cloetta B, Hagquist C, et al. # 1793-C/item-response-theory analyses of child and adolescent self-report quality of life data: the European Cross Cultural Research Instrument Kidscreen. Qual Life Res. 2003:722.
Ravens-Sieberer U, Gosch A, Rajmil L, Erhart M, Bruil J, Power M, et al. The KIDSCREEN-52 quality of life measure for children and adolescents: psychometric results from a cross-cultural survey in 13 European countries. Value Health. 2008;11(4):645–58.
Ravens-Sieberer U, Auquier P, Erhart M, Gosch A, Rajmil L, Bruil J, et al. The KIDSCREEN-27 quality of life measure for children and adolescents: psychometric results from a cross-cultural survey in 13 European countries. Qual Life Res. 2007;16(8):1347–56.
Ravens-Sieberer U, Erhart M, Rajmil L, Herdman M, Auquier P, Bruil J, et al. Reliability, construct and criterion validity of the KIDSCREEN-10 score: a short measure for children and adolescents’ well-being and health-related quality of life. Qual Life Res. 2010;19(10):1487–500.
Saigal S, Rosenbaum P, Stoskopf B, Hoult L, Furlong W, Feeny D, et al. Development, reliability and validity of a new measure of overall health for pre-school children. Qual Life Res. 2005;14(1):243–52.
Manificat S, Dazord A. Assessing adolescent’s quality of life: validation of a new questionnaire. Qual Life Newsl. 2002;28:2–3.
Singh G, Athreya BH, Fries JF, Goldsmith DP. Measurement of health status in children with juvenile rheumatoid arthritis. Arthritis Rheum. 1994;37(12):1761–9.
Endicott J, Nee J, Yang R, Wohlberg C. Pediatric quality of life enjoyment and satisfaction questionnaire (PQ-LES-Q): reliability and validity. J Am Acad Child Adolesc Psychiatry. 2006;45(4):401–7.
Cummins RA, Lau AL. Personal Wellbeing Index—school children. Melbourne: School of Psychology, Deakin University; 2005.
Fuh J, Wang S, Lu S, Juang K. Assessing quality of life for adolescents in Taiwan. Psychiatry Clin Neurosci. 2005;59(1):11–8.
Chen G, Xu F, Huynh E, Wang Z, Stevens K, Ratcliffe J. Scoring the Child Health Utility 9D instrument: estimation of a Chinese child and adolescent-specific tariff. Qual Life Res. 2019;28(1):163–76.
Apajasalo M, Rautonen J, Holmberg C, Sinkkonen J, Aalberg V, Pihko H, et al. Quality of life in pre-adolescence: a 17-dimensional health-related measure (17D). Qual Life Res. 1996;5(6):532–8.
Craig BM, Greiner W, Brown DS, Reeve BB. Valuation of child health-related quality of life in the United States. Health Econ. 2016;25(6):768–77.
Ratcliffe J, Chen G, Stevens K, Bradley S, Couzner L, Brazier J, et al. Valuing Child Health Utility 9D health states with young adults: insights from a time trade off study. Appl Health Econ Health Policy. 2015;13(5):485–92.
Rupel VP, Ogorevc M. EQ-5D-Y value set for Slovenia. Pharmacoeconomics. 2021;39(4):463–71.
Shiroiwa T, Ikeda S, Noto S, Fukuda T, Stolk E. Valuation survey of EQ-5D-Y Based on the international common protocol: development of a value set in Japan. Med Decis Mak. 2021;41(5):597–606.
Rowen D, Mulhern B, Stevens K, Vermaire JH. Estimating a Dutch value set for the pediatric preference-based CHU9D using a discrete choice experiment with duration. Value Health. 2018;21(10):1234–42.
Brazier J, Ratcliffe J, Saloman J, Tsuchiya A. Measuring and valuing health benefits for economic evaluation. Oxford: Oxford University Press; 2017.
Ratcliffe J, Huynh E, Chen G, Stevens K, Swait J, Brazier J, et al. Valuing the Child Health Utility 9D: using profile case best worst scaling methods to develop a new adolescent specific scoring algorithm. Soc Sci Med. 2016;157:48–59. https://doi.org/10.1016/j.socscimed.2016.03.042.
Torrance GW, Feeny DH, Furlong WJ, Barr RD, Zhang Y, Wang Q. Multiattribute utility function for a comprehensive health status classification system: Health Utilities Index Mark 2. Med Care. 1996;34:702–22.
McCabe C, Stevens K, Roberts J, Brazier J. Health state values for the HUI 2 descriptive system: results from a UK survey. Health Econ. 2005;14(3):231–44.
Krabbe PF, Jabrayilov R, Detzel P, Dainelli L, Vermeulen KM, van Asselt AD. A two-step procedure to generate utilities for the Infant health-related Quality of life Instrument (IQI). PLoS ONE. 2020;15(4):e0230852.
Beusterien KM, Yeung J-E, Pang F, Brazier J. Development of the multi-attribute adolescent health utility measure (AHUM). Health Qual Life Outcomes. 2012;10(1):1–9.
Stevens K. Valuation of the child health utility 9D index. Pharmacoeconomics. 2012;30(8):729–47.
Feeny D, Furlong W, Torrance GW, Goldsmith CH, Zhu Z, DePauw S, et al. Multiattribute and single-attribute utility functions for the health utilities index mark 3 system. Med Care. 2002;40(2):113–28.
Noto S, Uemura T. Japanese health utilities index mark 3 (HUI3): measurement properties in a community sample. J Patient-Rep Outcomes. 2020;4(1):9.
Le Galès C, Buron C, Costet N, Rosman S, Slama PG. Development of a preference-weighted health status classification system in France: the Health Utilities Index 3. Health Care Manag Sci. 2002;5(1):41–51.
Seiber WJ, Groessl EJ, David KM, Ganiats TG, Kaplan RM. Quality of well being self-administered (QWB-SA) scale. San Diego: Health Services Research Center, University of California; 2008.
Stevens K. Developing a descriptive system for a new preference-based measure of health-related quality of life for children. Qual Life Res. 2009;18(8):1105–13.
Janssens A, Rogers M, Coon JT, Allen K, Green C, Jenkinson C, et al. A systematic review of generic multidimensional patient-reported outcome measures for children, part II: evaluation of psychometric performance of English-language versions in a general population. Value Health. 2015;18(2):334–45.
Brennan A, Akehurst R. Modelling in health economic evaluation. Pharmacoeconomics. 2000;17(5):445–59.
Blackwell CK, Wakschlag L, Krogh-Jespersen S, Buss KA, Luby J, Bevans K, et al. Pragmatic health assessment in early childhood: the PROMIS® of developmentally based measurement for pediatric psychology. J Pediatr Psychol. 2020;45(3):311–8.
Manificat S, Dazord A. Infant, child and adolescent quality of life: surveys performed in a European context. Expert Rev Pharmacoecon Outcomes Res. 2002;2(6):589–96.
Ravens-Sieberer U, Bullinger M. Assessing health-related quality of life in chronically ill children with the German KINDL: first psychometric and content analytical results. Qual Life Res. 1998;7(5):399–407.
Eiser C, Morse R. Can parents rate their child’s health-related quality of life? Results of a systematic review. Qual Life Res. 2001;10(4):347–57.
Khadka J, Kwon J, Petrou S, Lancsar E, Ratcliffe J. Mind the (inter-rater) gap an investigation of self-reported versus proxy-reported assessments in the derivation of childhood utility values for economic evaluation: a systematic review. Soc Sci Med. 2019;240:112543.
Bevans KB, Riley AW, Forrest CB. Development of the healthy pathways parent-report scales. Qual Life Res. 2012;21(10):1755–70.
Coons SJ, Gwaltney CJ, Hays RD, Lundy JJ, Sloan JA, Revicki DA, et al. Recommendations on evidence needed to support measurement equivalence between electronic and paper-based patient-reported outcome (PRO) measures: ISPOR ePRO Good Research Practices Task Force report. Value Health. 2009;12(4):419–29.
Coombes L, Bristowe K, Ellis-Smith C, Aworinde J, Fraser LK, Downing J, et al. Enhancing validity, reliability and participation in self-reported health outcome measurement for children and young people: a systematic review of recall period, response scale format, and administration modality. Qual Life Res. 2021:30;1803–1832. https://doi.org/10.1007/s11136-021-02814-4.
Wasson JW, Kairys SW, Nelson EC, Kalishman N, Baribeau P. A short survey for assessing health and social problems of adolescents. J Fam Pract. 1994;38(5):489–95.
Ratcliffe J, Couzner L, Flynn T, Sawyer M, Stevens K, Brazier J, et al. Valuing child health utility 9D health states with a young adolescent sample. Appl Health Econ Health Policy. 2011;9(1):15–27.
Rowen D, Brazier J, Van Hout B. A comparison of methods for converting DCE values onto the full health-dead QALY scale. Med Decis Mak. 2015;35(3):328–40.
Viney R, Norman R, Brazier J, Cronin P, King MT, Ratcliffe J, et al. An Australian discrete choice experiment to value EQ-5D health states. Health Econ. 2014;23(6):729–42.
Ramos-Goñi JM, Oppe M, Stolk E, Shah K, Kreimeier S, Rivero-Arias O, et al. International valuation protocol for the EQ-5D-Y-3L. Pharmacoeconomics. 2020;38(7):653–63.
Rogers HJ, Marshman Z, Rodd H, Rowen D. Discrete choice experiments or best-worst scaling? A qualitative study to determine the suitability of preference elicitation tasks in research with children and young people. J Patient-Rep Outcomes. 2021;5(1):1–11.
Ungar WJ, Zur RM. Health economic evaluation for improving child health in low-and middle-income countries. In: Optimizing treatment for children in the developing world. Berlin: Springer; 2015. p. 213–24.
Rowen D, Keetharuth AD, Poku E, Wong R, Pennington B, Wailoo A. A review of the psychometric performance of selected child and adolescent preference-based measures used to produce utilities for child and adolescent health. Value Health. 2020;24:443–60.
Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN checklist for assessing the methodological quality of studies on measurement properties of health status measurement instruments: an international Delphi study. Qual Life Res. 2010;19(4):539–49.
Oluboyede Y, Robinson T. Measuring weight-specific quality of life in adolescents: an examination of the concurrent validity and test-retest reliability of the WAItE. Value Health. 2019;22(3):348–54.
Xie F, Pickard AS, Krabbe PF, Revicki D, Viney R, Devlin N, et al. A checklist for reporting valuation studies of multi-attribute utility-based instruments (CREATE). Pharmacoeconomics. 2015;33(8):867–77.
Manificat S, Dazord A, Langue J, Danjou G, Bauche P, Bovet F, et al. A new instrument to evaluate infant quality of life. Qual Life Newsl. 1999;7(8–7):8.
Manificat S, Dazord A. Children’s quality of life assessment: preliminary results obtained with the AUQUEI questionnaire. Qual Life Newsl. 1998;19:2–3.
Lam C, Young N, Marwaha J, McLimont M, Feldman BM. Revised versions of the Childhood Health Assessment Questionnaire (CHAQ) are more sensitive and suffer less from a ceiling effect. Arthritis Care Res. 2004;51(6):881–9.
Starfield B, Riley AW, Green BF, Ensminger ME, Ryan SA, Kelleher K, et al. The adolescent child health and illness profile: a population-based measure of health. Med Care. 1995;33:553–66.
Riley AW, Forrest CB, Rebok GW, Starfield B, Green BF, Robertson JA, et al. The child report form of the CHIP-child edition: reliability and validity. Med Care. 2004;42:221–31.
Riley AW, Forrest CB, Starfield B, Rebok GW, Robertson JA, Green BF. The parent report form of the CHIP-Child edition: reliability and validity. Med Care. 2004;42:210–20.
Landgraf J, Maunsell E, Speechley KN, Bullinger M, Campbell S, Abetz L, et al. Canadian-French, German and UK versions of the Child Health Questionnaire: methodology and preliminary item scaling results. Qual Life Res. 1998;7(5):433–45.
Raat H, Botterweck AM, Landgraf JM, Hoogeveen WC, Essink-Bot M-L. Reliability and validity of the short form of the child health questionnaire for parents (CHQ-PF28) in large random school based and general population samples. J Epidemiol Community Health. 2005;59(1):75–82.
Landgraf JM, Abetz LN. Functional status and well-being of children representing three cultural groups: initial self-reports using the CHQ-CF87. Psychol Health. 1997;12(6):839–54.
Maylath NS. Development of the Children’s Health Ratings Scale. Health Educ Q. 1990;17(1):89–97.
Hester NO. Child's health self-concept scale: its development and psychometric properties. Adv Nurs Sci. 1984;7(1):45–55.
Graham P, Stevenson J, Flynn D. A new measure of health-related quality of life for children: preliminary findings. Psychol Health. 1997;12(5):655–65.
Vo TXH, Guillemin F, Deschamps J-P. Psychometric properties of the DUKE Health Profile-adolescent version (DHP-A): a generic instrument for adolescents. Qual Life Res. 2005;14(10):2229–34.
Stein RE, Jessop DJ. Functional status II (R): a measure of child health status. Med Care. 1990;28:1041–55.
Collier J, MacKinley D. Developing a generic child quality of life measure. Health Psychol Update. 1997:12–6.
le Coq EM, Colland VT, Boeke AJP, Bezemer DP, van Eijk JTM. Reproducibility, construct validity, and responsiveness of the “How Are You?”(HAY), a self-report quality of life questionnaire for children with asthma. J Asthma. 2000;37(1):43–58.
Holmström MR, Olofsson N, Kristiansen L, Asplund K. Health among 6-year-old children in a Swedish county: based on the Health dialogue. Vulnerable Groups Incl. 2012;3(1):8416.
Holmström MR, Olofsson N, Asplund K, Kristiansen L. Exploring the development of school children’s health. Br J School Nurs. 2012;7(4):189–97.
Olofsson N, Rising Holmström M, Kristiansen L. Assessing the construct validity and reliability of school health records of the ‘Health Dialogue Questionnaire,’ in 7th Grade in Compulsory School. MOJ Public Health. 2015;2(1).
Kristiansen L, Holmstrom MR, Olofsson N. Assessing the construct validity and reliability of school health records using the ‘Health Dialogue Questionnaire’ in the eleventh grade. AIMS Public Health. 2016;3(3):470.
Jozefiak T, Wallander JL. Perceived family functioning, adolescent psychopathology and quality of life in the general population: a 6-month follow-up study. Qual Life Res. 2016;25(4):959–67.
Klassen AF, Landgraf JM, Lee SK, Barer M, Raina P, Chan HW, et al. Health related quality of life in 3 and 4 year old children and their parents: preliminary findings about a new questionnaire. Health Qual Life Outcomes. 2003;1(1):1–11.
Landgraf JM, Vogel I, Oostenbrink R, van Baar ME, Raat H. Parent-reported health outcomes in infants/toddlers: measurement properties and clinical validity of the ITQOL-SF47. Qual Life Res. 2013;22(3):635–46.
Barthel D, Fischer K, Nolte S, Otto C, Meyrose A-K, Reisinger S, et al. Implementation of the Kids-CAT in clinical settings: a newly developed computer-adaptive test to facilitate the assessment of patient-reported outcomes of children and adolescents in clinical practice in Germany. Qual Life Res. 2016;25(3):585–94.
Lindström B, Köhler L. Youth, disability and quality of life. Pediatrician. 1991.
Chan KS, Mangione-Smith R, Burwinkle TM, Rosen M, Varni JW. The PedsQL™: reliability and validity of the short-form generic core scales and asthma module. Med Care. 2005;43:256–65.
Varni JW, Limbers CA, Neighbors K, Schulz K, Lieu JE, Heffer RW, et al. The PedsQL™ Infant Scales: feasibility, internal consistency reliability, and validity in healthy and ill infants. Qual Life Res. 2011;20(1):45–55.
Ishaque S, Roberts R, Karnon J, Thomas D, Salter A, editors. Adaptation/content validation of measure yourself medical outcomes profile (MYMOP) questionnaire for 7–11 year old children. Quality of life research. Dordrectht: Springer; 2018.
Vaqué-Crusellas C, González M, Casas F. Does satisfaction with food matter? Testing the personal well-being index-school children (PWI-SC) with an additional item on satisfaction with food on a sample of 10 to 12-year-olds. Child Indic Res. 2015;8(4):961–73.
Bouman NH, Koot HM, Van Gils AP, Verhulst FC. Development of a health-related quality of life instrument for children: the quality of life questionnaire for children. Psychol Health. 1999;14(5):829–46.
Raphael D, Rukholm E, Brown I, Hill-Bailey P, Donato E. The quality of life profile—adolescent version: background, description, and initial validation. J Adolesc Health. 1996;19(5):366–75.
Huebner ES. Initial development of the student’s life satisfaction scale. Sch Psychol Int. 1991;12(3):231–40.
Huebner ES. Preliminary development and validation of a multidimensional life satisfaction scale for children. Psychol Assess. 1994;6(2):149.
Gilligan TD, Huebner S. Initial development and validation of the multidimensional students’ life satisfaction scale–adolescent version. Appl Res Qual Life. 2007;2(1):1–16.
Theunissen NC, Vogels TG, Koopman HM, Verrips GH, Zwinderman KA, Verloove-Vanhorick SP, et al. The proxy problem: child report versus parent report in health-related quality of life research. Qual Life Res. 1998;7(5):387–97.
Fekkes M, Theunissen N, Brugman E, Veen S, Verrips E, Koopman H, et al. Development and psychometric evaluation of the TAPQOL: a health-related quality of life instrument for 1–5-year-old children. Qual Life Res. 2000;9(8):961–72.
Lawford J, Volavka N, Eiser C. A generic measure of quality of life for children aged 3–8 years: results of two preliminary studies. Pediatr Rehabil. 2001;4(4):197–207.
Simeoni M, Auquier P, Antoniotti S, Sapin C, San MJ. Validation of a French health-related quality of life instrument for adolescents: the VSP-A. Qual Life Res. 2000;9(4):393–403.
Spencer PN, Coe C. The development and validation of a measure off parent-reported child health and morbidity: the Warwick Child Health and Morbidity Profile. Child Care Health Dev. 1996;22(6):367–79.
Edwards TC, Huebner CE, Connell FA, Patrick DL. Adolescent quality of life, part I: conceptual and measurement model. J Adolesc. 2002;25(3):275–86.
Patrick DL, Edwards TC, Topolski TD. Adolescent quality of life, part II: initial validation of a new instrument. J Adolesc. 2002;25(3):287–300.
Topolski TD, Patrick DL, Edwards TC, Huebner CE, Connell FA, Mount KK. Quality of life and health-risk behaviors among adolescents. J Adolesc Health. 2001;29(6):426–35.
Furlong WJ, Feeny DH, Torrance GW, Barr RD. The Health Utilities Index (HUI®) system for assessing health-related quality of life in clinical studies. Ann Med. 2001;33(5):375–84.
Stevens K. Assessing the performance of a new generic measure of health-related quality of life for children and refining it for use in health state valuation. Appl Health Econ Health Policy. 2011;9(3):157–69.
Gusi N, Perez-Sousa M, Gozalo-Delgado M, Olivares P. Validity and reliability of the Spanish EQ-5D-Y proxy version. Anales de Pediatría (English Edition). 2014;81(4):212–9.
Kreimeier S, Åström M, Burström K, Egmar A-C, Gusi N, Herdman M, et al. EQ-5D-Y-5L: developing a revised EQ-5D-Y with increased response categories. Qual Life Res. 2019;28(7):1951–61.
Kaplan RM, Bush JW, Berry CC. Health status: types of validity and the index of well-being. Health Serv Res. 1976;11(4):478–507.
Kaplan RM, Sieber WJ, Ganiats TG. The quality of well-being scale: comparison of the interviewer-administered version with a self-administered questionnaire. Psychol Health. 1997;12(6):783–91.
Verstraete J, Ramma L, Jelsma J. Validity and reliability testing of the Toddler and Infant (TANDI) health related quality of life instrument for very young children. J Patient-Rep Outcomes. 2020;4(1):1–14.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Funding
This research has been funded by the Australian Government’s Medical Research Future Fund under Grants MRF1200816 and MRF1199902. SP receives support as a National Institute for Health Research (NIHR) Senior Investigator (NF-SI-0616-10103) and from the NIHR Applied Research Collaboration Oxford and Thames Valley. CH is funded by an NIHR Doctoral Research Fellowship (NIHR300684). The views expressed are those of the authors and not necessarily those of the Australian Government, the NIHR or the Department of Health and Social Care in England.
Conflict of interest
ND and BM are members of the EuroQol Group, which developed the EQ-5D-Y, one of the measures included in review. Other authors have no conflict of interest to declare.
Ethical approval
Not applicable.
Availability of data and material
The Excel files of extracted data are available from the corresponding authors on reasonable request.
Code availability
The code used in this study is available from the authors upon reasonable request.
Consent for publication
The authors, jointly and severally, give the publisher the permission to publish the work.
Consent to participate
Not applicable.
Author contributions
Conceptualisation and methodology: all co-authors. Database search, study selection and data extraction: SP, JK, EH, MH, KK, SD, NR, CH and SS for identification of patient-reported outcome measures; LF, GC and ND for preference-based value sets. Data synthesis: SP, JK, EH, MH, KK, SD, CH and SS for identification of patient-reported outcome measures; LF, GC, ND, CB, KD, AH, BM and JR for preference-based value sets. Writing original draft: SP, JK, EH, MH, SD, GC, ND, CB, BM and JR. Draft review and editing: all co-authors.
Additional information
The original Online version of this article was revised: The co-author's name has been incorrectly published.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/.
About this article
Cite this article
Kwon, J., Freijser, L., Huynh, E. et al. Systematic Review of Conceptual, Age, Measurement and Valuation Considerations for Generic Multidimensional Childhood Patient-Reported Outcome Measures. PharmacoEconomics 40, 379–431 (2022). https://doi.org/10.1007/s40273-021-01128-0
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40273-021-01128-0