Psychometric properties and general population reference values for PROMIS Global Health in Hungary

Bató, Alex; Brodszky, Valentin; Mitev, Ariel Zoltán; Jenei, Balázs; Rencz, Fanni

doi:10.1007/s10198-023-01610-w

Psychometric properties and general population reference values for PROMIS Global Health in Hungary

Original Paper
Open access
Published: 28 June 2023

Volume 25, pages 549–562, (2024)
Cite this article

Download PDF

You have full access to this open access article

The European Journal of Health Economics Aims and scope Submit manuscript

Psychometric properties and general population reference values for PROMIS Global Health in Hungary

Download PDF

1801 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Objectives

Patient-Reported Outcomes Measurement Information System–Global Health (PROMIS-GH) is a widely used generic measure of health status. This study aimed to (1) assess the psychometric properties of the Hungarian PROMIS-GH and to (2) develop general population reference values in Hungary.

Methods

An online cross-sectional survey was conducted among the Hungarian adult general population (n = 1700). Respondents completed the PROMIS-GH v1.2. Unidimensionality (confirmatory factor analysis and bifactor model), local independence, monotonicity (Mokken scaling), graded response model fit, item characteristic curves and measurement invariance were examined. Spearman’s correlations were used to analyse convergent validity of PROMIS-GH subscales with SF-36v1 composites and subscales. Age- and gender-weighted T-scores were computed for the Global Physical Health (GPH) and Global Mental Health (GMH) subscales using the US item calibrations.

Results

The item response theory assumptions of unidimensionality, local independence and monotonicity were met for both subscales. The graded response model showed acceptable fit indices for both subscales. No differential item functioning was detected for any sociodemographic characteristics. GMH T-scores showed a strong correlation with SF-36 mental health composite score (r_s = 0.71) and GPH T-scores with SF-36 physical health composite score (r_s = 0.83). Mean GPH and GMH T-scores of females were lower (47.8 and 46.4) compared to males (50.5 and 49.3) (p < 0.001), and both mean GPH and GMH T-scores decreased with age, suggesting worse health status (p < 0.05).

Conclusion

This study established the validity and developed general population reference values for the PROMIS-GH in Hungary. Population reference values facilitate the interpretation of patients’ scores and allow inter-country comparisons.

Psychometric properties of the patient-reported outcomes measurement information system scale v1.2: global health (PROMIS-GH) in a Dutch general population

Article Open access 27 September 2021

Dutch reference values for the Patient-Reported Outcomes Measurement Information System Scale v1.2 - Global Health (PROMIS-GH)

Article Open access 12 May 2021

Psychometric evaluation of the Mental Health Quality of Life (MHQoL) instrument in seven European countries

Article Open access 01 September 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Health status measures are widely used in clinical practice, observational studies, clinical trials, monitoring general population health, assessing the performance of health systems and in cost-effectiveness analysis [1]. Two forms of health status measures can be distinguished as follows: condition-specific and generic [2]. Condition-specific measures have a specific target population and are able to capture a wide range of symptoms and health problems relevant to a certain condition (e.g. itching in skin diseases or bowel problems in gastrointestinal diseases). In contrast, generic health status measures incorporate health areas that are relevant across different patient populations as well as for the general public (e.g. physical functioning, pain, sleeping). These measures have the advantage of allowing comparisons across different conditions, health interventions and with general population reference values.

In general, a large number of items are needed to precisely assess one’s health status; however, this may lack practical considerations (e.g. time and respondent burden). Therefore, short-form health assessments have gained popularity. Commonly used short generic health status measures include the EQ-5D and SF-36 [3, 4]. These instruments, however, were developed decades ago and one of their common criticisms is that their item development and selection did not benefit from modern psychometric methods, such as item response theory (IRT). The Patient Reported Outcomes Measurement Information System (PROMIS) initiative, funded by the National Institutes of Health in the US, aimed to develop, validate and standardize item banks to measure health outcomes across a broad range of health areas [5]. In the past two decades, over 100 PROMIS item banks and a few fixed-length short-forms have been developed using IRT methods (e.g. PROMIS Global Health, PROMIS-29, PROMIS-43, PROMIS-57) [6, 7]. The main advantages of IRT over classical test theory methods include the estimation of the respondents’ location on an underlying ‘latent’ trait (e.g. health status) based on any subset of items that do not vary depending on the characteristics of the population and the possibility to adaptively assess health status using computerised adaptive testing [8, 9].

PROMIS Global Health (PROMIS-GH) is the shortest PROMIS short-form that measures five generic domains of health (physical functioning, pain, fatigue, emotional distress, social health) using 10 global health items [10]. Its validity, reliability and responsiveness have been confirmed in several populations, including patients with stroke [11], orthopaedic conditions [12,13,14,15,16], amyloidosis [17], inflammatory bowel diseases [18], pregnant women [19] and older adults [20]. The international use of PROMIS-GH has also been expanding outside the US, including studies from the UK [21], Germany [22] and the Netherlands [23]. Furthermore, two countries, the US and the Netherlands have also established general population reference values [24, 25]. So far, PROMIS-GH has not been used in Hungary. Therefore, this study aimed to evaluate the psychometric performance of the Hungarian PROMIS-GH and to develop general population reference values in Hungary.

Methods

Study design and recruitment

The study was approved by the Research Ethics Committee of the Corvinus University of Budapest (no. KRH/343/2020). In November 2020, an online cross-sectional survey was conducted among the Hungarian adult general population. Respondents were recruited by a survey company from members of the largest Hungarian online panel. ‘Soft quotas’ were used for age, gender, education, place of living and geographical region to approximate the distribution of the general population. The inclusion criteria for this study were as follows: (i) ≥ 18 years of age; (ii) place of residence in Hungary; and (iii) giving informed consent prior to data collection.

PROMIS-GH v1.2 was administered as part of a longer survey that aimed to assess the health status and well-being among members of the general public in Hungary [26,27,28]. Respondents were also asked to complete the SF-36v1 and to identify their sociodemographic background (gender, age, education, place of residence, region, employment, household’s net monthly income, marital status, body weight and height) and if they had any chronic health conditions. All respondents first completed SF-36, followed by PROMIS-GH.

Measures

PROMIS Global Health (PROMIS-GH)

The official Hungarian version of PROMIS-GH v1.2 was used as provided by the PROMIS Health Organization. PROMIS-GH consists of 10 items, namely Global01 = general health, Global02 = quality of life, Global03 = physical health, Global04 = mental health, Global05 = satisfaction with discretionary social activities, Global06 = physical function, Global07 = pain, Global08 = fatigue, Global09 = social roles and Global10 = emotional problems [10]. It has two subscales, Global Physical Health (GPH) and Global Mental Health (GMH). GPH consists of Global03, Global06, Global07 and Global08, while GMH includes Global02, Global04, Global05 and Global10. The recall period of the items varies across ‘in general’, the ‘past seven days’ and unspecified. Each item is assessed on a scale with five response levels. For Global01, Global02, Global03, Global04, Global05 and Global09, the best response option is excellent (5), and the worst is poor (1). For Global06 options range from completely (5) to not at all (1), for Global10 from never (5) to always (1) and for Global08 from none (5) to very severe (1). An exception is Global07, which is rated from 0 to 10 (0 = no pain, 10 = worst imaginable pain). We recoded Global07 to a 5-point scale as follows: 0 = 5; 1–3 = 4; 4–6 = 3; 7–9 = 2; 10 = 1 [10]. Raw subscale scores were calculated by adding scores of individual items per subscale. We calculated standardized T-scores from raw scores using the US item calibrations [29]. Mean T-scores therefore represent the mean of the US general population. A higher T-score indicates better health status and a lower T-score refers to worse health status compared to the US general population, where the general population mean is set at 50 with a standard deviation of 10 [24].

36-item short form health survey (SF-36)

The Hungarian version of the SF-36v1 questionnaire was used in our survey with a 4-week recall period. SF-36 is a generic health status measure with 36 items that cover eight health subscales, specifically (1) physical functioning, (2) role limitations due to physical problems (3) bodily pain, (4) general health, (5) vitality, (6) social functioning, (7) role limitations due to emotional problems and (8) mental health [4, 30]. Responses to items are transformed to range from 0 to 100, where higher scores represent better health status. Subscale scores are computed by averaging the respective item scores. SF-36 allows the generation of two summary scores, one for physical health (physical health composite) that includes the first four subscales (1–4) and the other for mental health (mental health composite) including the last four (5–8).

Statistical analyses

In this study, we built on the methods used in earlier psychometric investigations and reference population studies with PROMIS instruments [9, 10, 23, 25]. Data analysis was carried out in R Statistical Software (v4.1.2 Vienna, Austria). We used both classical test theory (e.g. ceiling and floor effect, convergent validity, factor analysis) and IRT methods. Before IRT modelling, we tested the following three assumptions: unidimensionality, local independence and monotonicity [31]. In addition, differential item functioning (DIF) analysis was used to examine measurement invariance. Raw item and subscale scores were used to analyse ceiling and floor effect and for the factor analysis, IRT and DIF analyses. Unweighted T-scores were used to draw histograms and estimate correlations. T-scores were weighted for age group and gender to calculate Hungarian GPH and GMH general population reference values.

Ceiling and floor effect

Ceiling and floor effect were considered if GPH and GMH raw subscale scores exceeded 15% [32].

Unidimensionality

Unidimensionality was tested using confirmatory factor analysis (CFA) and bifactor models. CFA was conducted for the two subscales separately (lavaan package) [33]. Goodness-of-fit was evaluated by the comparative fit index (CFI, cut-off value: > 0.95), Tucker-Lewis index (TLI, cut-off value: > 0.95), the root mean square error of approximation (RMSEA, cut-off value < 0.06), and the standardized root mean squared residual (SRMR, cut-off value: < 0.08) [34, 35]. Further, we used bifactor models to obtain Omega Hierarchical (tentative benchmark > 0.70) and explained common variance (ECV, tentative benchmark > 0.60) [36, 37]. The bifactor models were developed using the psych package [38].

Local independence

To test local independence, we examined the residual correlation matrix resulting from the CFA for both GPH and GMH subscales. Residual correlation values between − 0.20 and 0.20 were considered acceptable supporting local independence [9].

Monotonicity

Monotonicity was analysed using Mokken scale analysis (mokken package). Coefficients (H_i for items, H for subscales) exceeding the cut-off value of > 0.30 were considered acceptable [39, 40].

IRT model fit

Given the polytomous response options of PROMIS-GH items, a graded response model was fitted for both GPH and GMH (mirt package) [41, 42]. To detect item misfit, we used Orlando and Thissen’s S-χ². Items with p-value < 0.001 were considered misfitting [43]. The same cut-off values were used for fit indices (CFI, TLI, RMSEA, SRMR) as for unidimensionality [34]. Item discrimination (slope, a) and item difficulties (threshold, b) were also computed. Item characteristic curves (ICC) were generated for each item of the two subscales.

Measurement invariance

Measurement invariance was assessed by analysing differential item functioning (DIF) using the lordif package [44]. DIF occurs when the responses of a subgroup of respondents on an item consistently differ from those of another subgroup when controlling for the underlying level of the trait measured by the scale [9]. DIF was analysed for GPH and GMH with the following subgroups: gender (female, male), median age (< 47, ≥ 47 years), education (primary, secondary, tertiary), region (Central, Western and Eastern Hungary), employment (employed, not employed), place of residence (capital, other town, village), marital status (married, not married), and income groups (quintiles, do not know, refused to answer groups). First, we used ordinal logistic regression models without an anchor to evaluate DIF. Where DIF was detected, we repeated the analysis using non-DIF items as an anchor. A Pseudo R² change ≥ 0.02 was taken as a critical value [45, 46]. The details of the DIF analysis are provided elsewhere [28].

Convergent validity

Spearman’s rank order correlations were used to explore the convergent validity of the two PROMIS-GH subscales with the eight SF-36 subscales and two composite scores. Correlation coefficients (r_s) were interpreted as very weak (< 0.20), weak (0.20–0.39), moderate (0.40–0.59) and strong correlation (0.60 ≤) [47].

Establishment of general population reference values

Mean GPH and GMH T-scores were weighted according to gender and age group to derive general population reference values using the US item calibrations [48]. Mean weighted T-scores were computed for subgroups of respondents defined by gender, age groups, education, place of residence, region, employment, income groups, marital status, health status question of SF-36 (item 1), BMI and the presence of any chronic condition. We used Taylor linearization for standard errors, and 95% confidence intervals were calculated for each group. The subgroups were compared using Mann–Whitney or Kruskal–Wallis tests, where applicable.

Hypotheses

Regarding the psychometric properties, we hypothesized (1) no ceiling or floor effects for any subscales, (2) unidimensionality, (3) local independence, (4) monotonicity, (5) acceptable fit to the graded response model, (6) no measurement invariance for any subgroups, (7) moderate or strong correlations between the PROMIS-GH subscales (GPH and GMH) and their corresponding SF-36 composite scores [10, 11, 23, 49]. With regard to the reference values, we hypothesized better self-reported health in men and declining physical health with age [50].

Results

Sample characteristics (unweighted)

Overall, 2502 respondents initiated the survey, 2079 of whom consented and 379 quit before the end of the questionnaire. A total of 1700 respondents completed the survey. The mean age was 47.9 ± 16.3 years, and 56.3% of the respondents were female. Nearly one-third of the sample had tertiary education (32.4%). Half of the respondents were employed (50.9%), 23.5% were retired and 4.4% were students. Overall, 22.4% lived in the capital, 48.2% in other towns and 29.4% in villages. The geographical distribution of the sample was as follows: Western Hungary 29.0%, Central Hungary 33.6%, Eastern Hungary 37.4%. Overall, 67.4% of the sample reported to have any chronic disease. The overall sample showed a good representativeness for the general population in Hungary; however, respondents with a secondary education were slightly underrepresented and those who lived in the capital were somewhat overrepresented (Table 1).

Table 1 Characteristics of the study population and PROMIS Global Health reference values in Hungary

Full size table

Ceiling and floor effect

The distributions of GPH and GMH raw scores are presented in Fig. 1. We found almost no floor and low ceiling effect for both GPH (0.4% and 4.1%) and GMH subscales (0.5% and 4.8%) (Table 2). Among the items, Global07 demonstrated the highest floor (29.8%). Global06 showed the highest ceiling (58.2%), followed by Global10 (38.3%), Global08 (23.9%) and Global09 (15.8%).

Table 2 Floor and ceiling of PROMIS Global Health items and subscales

Full size table

Factor and IRT analysis

Unidimensionality

Fit indices confirmed the unidimensionality of both GPH (CFI = 0.993, TLI = 0.978, SRMR = 0.039) and GMH (CFI = 0.999, TLI = 0.997, SRMR = 0.025), with the exception of RMSEA (GPH 0.114 and GMH 0.071). The hypotheses were supported by the bifactor models, resulting in ECV values higher than the tentative benchmark for both subscales (GPH 0.72 and GMH 0.78). Omega Hierarchical was above the tentative benchmark only for GMH (0.73), but not for GPH (0.66) (Table 3).

Table 3 Psychometric properties of PROMIS Global Health subscales

Full size table

Local independence

We found no local dependence between item pairs (Online Resource 1). Eight item pairs had negative residual correlations, but all values were above the value of − 0.20.

Monotonicity

The Mokken scale analysis resulted in coefficients higher than the cut-off value for both subscales (H = 0.531 and 0.638 for GPH and GMH) and items, ranging from H_i = 0.480 (Global08) to 0.717 (Global04) supporting monotonicity (Table 3).

Model fit

Given that unidimensionality, local independence and monotonicity were supported for both subscales, graded response models were fitted. Acceptable fit indices were found for both subscales (GPH: RMSEA = 0.008, SRMR = 0.045, TLI = 0.905, CFI = 0.968 and GMH: RMSEA = 0.012, SRMR = 0.031, TLI = 0.969, CFI = 0.990). A few items showed misfit to the graded response model, namely Global03, Global06, Global02, Global05 and Global10 (p < 0.001) (Table 3). Item difficulties (b) ranged from − 3.7 (Global08) to 1.7 (Global03) for GPH and from − 2.9 (Global10) to 1.7 (Global02) for GMH. Item discrimination (a) values ranged from 1.6 (Global08) to 2.3 (Global07) and from 1.7 (Global10) to 8.0 (Global04) for GPH and GMH, respectively. ICCs for the two subscales are displayed in Fig. 2.

Measurement invariance

After the first step (without anchors), one item (Global07) was flagged for DIF based on age groups, and two items (Global02 and Global10) were flagged for DIF by gender. After the second step (with anchors), DIF was no longer detected for age group and gender, as the Pseudo R² change was < 0.02 for each analysis. No DIF was detected for education, region, employment, place of residence, marital status or income at all.

Convergent validity

GMH T-score showed a strong correlation with the mental health composite score of SF-36 (r_s = 0.708) and GPH T-score with the physical health composite score (r_s = 0.829) (Fig. 3). Among the SF-36 subscales, the GPH T-score had the highest correlation with general health (r_s = 0.740) and bodily pain (r_s = 0.738), while the GMH T-score showed the strongest correlation with mental health (r_s = 0.699) and vitality (r_s = 0.657).

Reference values for PROMIS-GH in Hungary

Mean total T-scores for GPH and GMH were 49.0 and 47.7, respectively (Table 1). Mean GPH and GMH T-scores of females were lower (47.8 and 46.4) compared to males (50.5 and 49.3) (p < 0.001). We found the highest mean T-scores for GPH and GMH in the 18–24 age group (GPH: 52.3 and GMH: 49.9). Mean GPH and GMH T-scores showed a decreasing trend with age (p < 0.05). Those with higher level of education, living in towns, being student, having higher income and without chronic disease had higher mean T-scores scores for both GPH and GMH (p < 0.001). With regard to BMI, mean GPH T-scores were higher in respondents with normal weight compared to those being underweight or overweight/obese (p < 0.05). Those who reported ‘excellent’ health on the first question of the SF-36 had the highest, while those who reported ‘poor’ had the lowest mean GPH and GMH T-scores (p < 0.001).

Discussion

This study provided a psychometric assessment of the Hungarian version of PROMIS-GH and developed population reference values for its physical and mental health subscales in Hungary. We used both classical test theory and IRT methods to establish the psychometric properties of the measure. PROMIS-GH subscales showed no ceiling and floor effects. All assumptions of IRT (unidimensionality, local independence and monotonicity) were met. Although the Omega Hierarchical value was below the tentative benchmark for GPH, it is important to emphasize that PROMIS-GH is inherently a multidimensional measure, and therefore, individual subscale values within the range of 0.6 and 0.8 seem appropriate both for Omega Hierarchical and ECV [36, 37]. The goodness of fit to the graded response model was acceptable with a few items misfitting. We found no measurement invariance for any sociodemographic characteristics. Strong correlations were found between corresponding PROMIS-GH subscales and SF-36 physical and mental health composite scores. Mean GPH and GMH T-scores in the Hungarian general population were 49.0 and 47.7, respectively.

It is worthwhile to compare our findings about the psychometric performance of PROMIS-GH to those of earlier psychometric studies among members of the general population in the Netherlands and the US [10, 23]. First, unidimensionality was supported with negligible deviations in each study. No local dependence was detected in the Hungarian and Dutch general population samples. The coefficients of the Mokken scale analysis showed that monotonicity was supported in the Hungarian and Dutch samples, and an interesting similarity occurred that in both studies the Global06 item had the smallest distance between the thresholds (Hungarian: − 2.879 to − 0.252; Dutch: − 2.668 to − 0.055). The range of item difficulty values (b) were very similar in all three general population studies with small differences at both ends (US: − 3.0 to 1.5, Hungarian: − 3.7 to 1.7, Dutch: − 3.7 to 1.9) [10, 23]. Ranges of item discrimination parameters (a) were similar for both subscales with slight differences between the US and Dutch studies [10, 23]. While the item discrimination parameters of the Hungarian GPH were in the same range (from 1.6 to 2.3) as the previous two, the Hungarian GMH was somewhat biased due to Global04 (from 1.7 to 8.0), as it usually ranges between 0.5 and 2.5 [31].

The Hungarian overall mean GPH and GMH T-scores (49.0 and 47.7) were slightly lower than those of the US reference population values (GPH: 50.0, GMH: 50.0) and higher than the Dutch values (GPH: 45.2, GMH: 44.7), suggesting that the Hungarian general population is in a better health status than the Dutch (Online Resource 2). By contrast, the standardized Dutch SF-36 physical (49.7) and mental health composite score (52.1) were somewhat higher than the Hungarian scores (48.3 and 48.2), implying that the Dutch general population is in a better health status [51]. However, the Dutch population norm data were collected using the SF-12 and in 1996, which may limit the comparison [52]. A similar pattern was observed for GPH and GMH in the Hungarian general population as in the US and Dutch samples, with a decreasing mean T-score with age, and males reporting better health status than females [25, 53]. However, it should be noted that the US sample (data collected in 2006–2007) and the Dutch sample (data collected in 2016) were obtained considerably earlier compared to this study. In addition, the US calibration sample may not be representative for the European populations. Ultimately, the following characteristics were associated with better physical and mental health in the Hungarian sample: being younger, male, having higher level of education, living in towns, student status, having a higher level of income, having no chronic diseases and reporting better self-perceived health on the first question of the SF-36.

A surprising finding of this study is that the Hungarian general population reported better overall health status than the Dutch general population. Life expectancy in the Netherlands is almost one year higher (81.5) than the weighted EU average (80.6), while life expectancy in Hungary is almost five years (75.7) behind the weighted EU average [54]. In terms of government funding, compulsory and voluntary health insurance and out-of-pocket payments, the Netherlands has one of the highest per capita spending on healthcare in the EU, while Hungary continues to fall behind the EU average in this regard. The greatest contrast might be in the fact that in 2019, 75% of the Dutch general public reported that they were in good health, and this figure did not reach 60% in Hungary in the same year [54]. However, the comparison of PROMIS-GH scores between these two countries is limited by the fact that the Dutch sample was not representative for some important sociodemographic and health-related characteristics of the general population, such as employment and marital status, income and the prevalence of chronic diseases [25].

This study has a few limitations. Our data were collected during the pandemic that might have influenced health status of the general population. However, a recent study has shown that the COVID-19 pandemic had negligible impact on the health status of US patients measured by PROMIS-GH [55]. Furthermore, self-reported health status on the first question of SF-36 in our study was very similar to what had been reported in a pre-COVID online general population survey in Hungary in 2019 [56]. Selection bias might have occurred as online panel data collections may be subject to possible self-selection and underrepresentation of certain groups (e.g. those without internet access) [57]. Another limitation is the cross-sectional nature of this study that prevented us from assessing test–retest reliability and responsiveness of PROMIS-GH.

In conclusion, this study provided an extensive psychometric analysis of the Hungarian PROMIS-GH in a large general population sample and established general population reference values for Hungary. Future research is recommended to replicate this general population study after the COVID-19 pandemic and further test psychometric properties of the Hungarian PROMIS-GH in paper-and-pencil surveys, longitudinal studies and with various patient populations.

Data availability

Data are available from the corresponding author upon a reasonable request.

References

Nelson, E.C., Eftimovska, E., Lind, C., Hager, A., Wasson, J.H., Lindblad, S.: Patient reported outcome measures in practice. BMJ: Br. Med. J. 350, g7818 (2015)
Article Google Scholar
Patrick, D.L., Deyo, R.A.: Generic and disease-specific measures in assessing health status and quality of life. Med. Care 27(3 Suppl), S217-232 (1989)
Article CAS PubMed Google Scholar
EuroQol Group: EuroQol—a new facility for the measurement of health-related quality of life. Health Policy 16(3), 199–208 (1990)
Ware Jr, J.E., Sherbourne, C.D.: The MOS 36-item short-form health survey (SF-36): I. conceptual framework and item selection. Med. Care 30(6), 473–483 (1992)
Cella, D., Yount, S., Rothrock, N., Gershon, R., Cook, K., Reeve, B., et al.: The patient-reported outcomes measurement information system (PROMIS): progress of an NIH Roadmap cooperative group during its first two years. Med. Care 45(5 Suppl 1), S3-s11 (2007)
Article PubMed PubMed Central Google Scholar
Cella, D., Riley, W., Stone, A., Rothrock, N., Reeve, B., Yount, S., et al.: The patient-reported outcomes measurement information system (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. J. Clin. Epidemiol. 63(11), 1179–1194 (2010)
Article PubMed PubMed Central Google Scholar
Cella, D., Choi, S.W., Condon, D.M., Schalet, B., Hays, R.D., Rothrock, N.E., et al.: PROMIS® adult health profiles: efficient short-form measures of seven health domains. Value Health 22(5), 537–544 (2019)
Article PubMed PubMed Central Google Scholar
Hays, R.D., Morales, L.S., Reise, S.P.: Item response theory and health outcomes measurement in the 21st century. Med. Care 38(9 Suppl), Ii28-42 (2000)
CAS PubMed PubMed Central Google Scholar
Reeve, B.B., Hays, R.D., Bjorner, J.B., Cook, K.F., Crane, P.K., Teresi, J.A., et al.: Psychometric evaluation and calibration of health-related quality of life item banks: plans for the patient-reported outcomes measurement information system (PROMIS). Med. Care 45(5 Suppl 1), S22–S31 (2007)
Hays, R.D., Bjorner, J.B., Revicki, D.A., Spritzer, K.L., Cella, D.: Development of physical and mental health summary scores from the patient-reported outcomes measurement information system (PROMIS) global items. Qual. Life Res. 18(7), 873–880 (2009)
Article PubMed PubMed Central Google Scholar
Katzan, I.L., Lapin, B.: PROMIS GH (patient-reported outcomes measurement information system global health) scale in stroke: a validation study. Stroke 49(1), 147–154 (2018)
Article PubMed Google Scholar
Suriani, R.J., Kassam, H.F., Passarelli, N.R., Esparza, R., Kovacevic, D.: Validation of PROMIS Global-10 compared with legacy instruments in patients with shoulder instability. Shoulder Elbow 12(4), 243–252 (2020)
Article PubMed Google Scholar
Lapin, B., Davin, S., Stilphen, M., Benzel, E., Katzan, I.L.: Validation of PROMIS CATs and PROMIS global health in an interdisciplinary pain program for patients with chronic low back pain. Spine 45(4), E227-e235 (2020)
Article PubMed Google Scholar
Kahan, J.B., Kassam, H.F., Nicholson, A.D., Saad, M.A., Kovacevic, D.: Performance of PROMIS global-10 to legacy instruments in patients with lateral epicondylitis. Arthroscopy 35(3), 770–774 (2019)
Article PubMed Google Scholar
Nicholson, A.D., Kassam, H.F., Pan, S.D., Berman, J.E., Blaine, T.A., Kovacevic, D.: Performance of PROMIS global-10 compared with legacy instruments for rotator cuff disease. Am. J. Sports Med. 47(1), 181–188 (2019)
Article PubMed Google Scholar
Parker, D.J., Werth, P.M., Christensen, D.D., Jevsevar, D.S.: Differential item functioning to validate setting of delivery compatibility in PROMIS-global health. Qual. Life Res. 31(7), 2189–2200 (2022)
Article PubMed Google Scholar
D’Souza, A., Magnus, B.E., Myers, J., Dispenzieri, A., Flynn, K.E.: The use of PROMIS patient-reported outcomes (PROs) to inform light chain (AL) amyloid disease severity at diagnosis. Amyloid 27(2), 111–118 (2020)
Article CAS PubMed PubMed Central Google Scholar
IsHak, W.W., Pan, D., Steiner, A.J., Feldman, E., Mann, A., Mirocha, J., et al.: Patient-reported outcomes of quality of life, functioning, and GI/psychiatric symptom severity in patients with inflammatory bowel disease (IBD). Inflamm. Bowel Dis. 23(5), 798–803 (2017)
Article PubMed Google Scholar
Slavin, V., Gamble, J., Creedy, D.K., Fenwick, J., Pallant, J.: Measuring physical and mental health during pregnancy and postpartum in an Australian childbearing population - validation of the PROMIS global short form. BMC Pregnancy Childbirth 19(1), 370 (2019)
Article PubMed PubMed Central Google Scholar
Allen, J., Alpass, F.M., Stephens, C.V.: The sensitivity of the MOS SF-12 and PROMIS® global summary scores to adverse health events in an older cohort. Qual. Life Res. 27(8), 2207–2215 (2018)
Article PubMed Google Scholar
Shim, J., Hamilton, D.F.: Comparative responsiveness of the PROMIS-10 global health and EQ-5D questionnaires in patients undergoing total knee arthroplasty. Bone Joint J. 101(7), 832–837 (2019)
Article PubMed Google Scholar
Philipp, R., Lebherz, L., Thomalla, G., Härter, M., Appelbohm, H., Frese, M., et al.: Psychometric properties of a patient-reported outcome set in acute stroke patients. Brain Behav. 11(8), e2249 (2021)
Article PubMed PubMed Central Google Scholar
Pellicciari, L., Chiarotto, A., Giusti, E., Crins, M.H.P., Roorda, L.D., Terwee, C.B.: Psychometric properties of the patient-reported outcomes measurement information system scale v.12: global health (PROMIS-GH) in a Dutch general population. Health Qual. Life Outcomes 19(1), 226 (2021)
Article PubMed PubMed Central Google Scholar
Liu, H., Cella, D., Gershon, R., Shen, J., Morales, L.S., Riley, W., et al.: Representativeness of the patient-reported outcomes measurement information system internet panel. J. Clin. Epidemiol. 63(11), 1169–1178 (2010)
Article PubMed PubMed Central Google Scholar
Elsman, E.B.M., Roorda, L.D., Crins, M.H.P., Boers, M., Terwee, C.B.: Dutch reference values for the patient-reported outcomes measurement information system scale v.12 - global health (PROMIS-GH). J. Patient-Report. Outcomes 5(1), 38 (2021)
Article Google Scholar
Rencz, F., Janssen, M.F.: Analyzing the pain/discomfort and anxiety/depression composite domains and the meaning of discomfort in the EQ-5D: a mixed-methods study. Value Health 25(12), 2003–2016 (2022)
Rencz, F., Brodszky, V., Janssen, M.F.: A direct comparison of the measurement properties of EQ-5D-5L, PROMIS-29+2 and PROMIS Global Health instruments and EQ-5D-5L and PROPr utilities in a general population sample. Value Health (2023). https://doi.org/10.1016/j.jval.2023.02.002
Jenei, B., Bató, A., Mitev, A.Z., Brodszky, V., Rencz, F.: Hungarian PROMIS-29+2: psychometric properties and population reference values. Qual. Life Res. (2023). https://doi.org/10.1007/s11136-023-03364-7
HealthMeasures (2017). PROMIS global health scoring manual. http://www.healthmeasures.net/images/PROMIS/manuals/PROMIS_Global_Scoring_Manual.pdf. Accessed 7 Sept 2021
Brazier, J.E., Harper, R., Jones, N., O’cathain, A., Thomas, K., Usherwood, T., et al.: Validating the SF-36 health survey questionnaire: new outcome measure for primary care. BMJ 305(6846), 160–164 (1992)
Article CAS PubMed PubMed Central Google Scholar
Reeve, B.B., Fayers, P.: Applying item response theory modeling for evaluating questionnaire item and scale properties. Assess. Qual. Life Clin Trials: Methods Pract. 2, 55–73 (2005)
Article Google Scholar
Terwee, C.B., Bot, S.D.M., de Boer, M.R., van der Windt, D.A.W.M., Knol, D.L., Dekker, J., et al.: Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. 60(1), 34–42 (2007)
Article PubMed Google Scholar
Rosseel, Y.: lavaan: an R package for structural equation modeling. J. Stat. Softw. 48(2), 1–36 (2012)
Hu, L.T., Bentler, P.M.: Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ. Model.: A Multidiscip. J. 6(1), 1–55 (1999)
Article Google Scholar
Rosseel, Y.: Lavaan: an R package for structural equation modeling and more. Version 0.5–12 (BETA). J. Stat. Softw. 48(2), 1–36 (2012)
Article Google Scholar
Reise, S.P., Scheines, R., Widaman, K.F., Haviland, M.G.: Multidimensionality and structural coefficient bias in structural equation modeling: a bifactor perspective. Educ. Psychol. Measur. 73(1), 5–26 (2013)
Article Google Scholar
Rodriguez, A., Reise, S.P., Haviland, M.G.: Applying bifactor statistical indices in the evaluation of psychological measures. J. Pers. Assess. 98(3), 223–237 (2016)
Article PubMed Google Scholar
Revelle, W.R.: psych: Procedures for psychological, psychometric, and personality research. Northwestern University, Evanston, Illinois. R package version 2.3.3 (2023). https://CRAN.R-project.org/package=psych. Accessed 17 Jun 2023
Mokken, R.J.: A theory and procedure of scale analysis: with applications in political research: De Gruyter Mouton. ISBN: 9783110813203 (2011)
van der Ark, L.A.: Mokken scale analysis in R. J. Stat. Softw. 20(11), 1–19 (2007)
Article Google Scholar
Chalmers, R.P.: mirt: a multidimensional item response theory package for the R environment. J. Stat. Softw. 48(6), 1–29 (2012)
Article Google Scholar
Samejima, F.: Estimation of latent ability using a response pattern of graded scores. Psychometrika Monogr. Suppl. 34(4, Pt. 2), 100 (1969)
Kang, T., Chen, T.T.: Performance of the generalized S-X2 item fit index for the graded response model. Asia Pac. Educ. Rev. 12(1), 89–96 (2011)
Article Google Scholar
Choi, S.W., Gibbons, L.E., Crane, P.K.: Lordif: an R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. J. Stat. Softw. 39(8), 1 (2011)
Article PubMed PubMed Central Google Scholar
Crane, P.K., Gibbons, L.E., Jolley, L., van Belle, G.: Differential item functioning analysis with ordinal logistic regression techniques: DIFdetect and difwithpar. Med. Care 44(11 Suppl 3), S115–S123 (2006)
Article PubMed Google Scholar
Kopf, J., Zeileis, A., Strobl, C.: Anchor selection strategies for DIF analysis: review, assessment, and new approaches. Educ. Psychol. Measur. 75(1), 22–56 (2015)
Article PubMed Google Scholar
Swinscow, T.D.V., Campbell, M.J. (2002). Statistics at square one: Bmj London. 0727915525
Hungarian central statistical office (2016). Microcensus 2016. Available from: https://www.ksh.hu/docs/eng/xftp/idoszaki/microcensus2016/microcensus_2016_3.pdf. Accessed 7 Sept 2021.
Oosterveer, D.M., Arwert, H., Terwee, C.B., Schoones, J.W., Vlieland, T.P.M.V.: Measurement properties and interpretability of the PROMIS item banks in stroke patients: a systematic review. Qual. Life Res. 31(12), 3305–3315 (2022)
Szende, A., Németh, R.: Health-related quality of life of the Hungarian population. Orv. Hetil. 144(34), 1667–1674 (2003)
PubMed Google Scholar
Gandek, B., Ware, J.E., Aaronson, N.K., Apolone, G., Bjorner, J.B., Brazier, J.E., et al.: Cross-validation of item selection and scoring for the SF-12 health survey in nine countries: results from the IQOLA project. J. Clin. Epidemiol. 51(11), 1171–1178 (1998)
Article CAS PubMed Google Scholar
Gandek, B., Ware, J.E., Jr.: Methods for validating and norming translations of health status questionnaires: the IQOLA project approach. J. Clin. Epidemiol. 51(11), 953–959 (1998)
Article CAS PubMed Google Scholar
HealthMeasures (2021). PROMIS score cut points. https://www.healthmeasures.net/score-and-interpret/interpret-scores/promis/promis-score-cut-points, Accessed 7 Sept 2021
OECD: Health at a Glance 2021: OECD Indicators. OECD Publishing, Paris (2021). https://doi.org/10.1787/ae3016b9-en
Lapin, B.R., Tang, W.H.W., Honomichl, R., Hogue, O., Katzan, I.L.: Evidence of stability in patient-reported global health during the COVID-19 Pandemic. Value Health 24(11), 1578–1585 (2021)
Article PubMed PubMed Central Google Scholar
Rencz, F., Tamási, B., Brodszky, V., Ruzsa, G., Gulácsi, L., Péntek, M.: Did You get what you wanted? Patient satisfaction and congruence between preferred and perceived roles in medical decision making in a hungarian national survey. Value Health Reg. Issues 22, 61–67 (2020)
Article PubMed Google Scholar
Bethlehem, J.: Selection bias in web surveys. Int. Stat. Rev. 78(2), 161–188 (2010)
Article Google Scholar

Download references

Acknowledgements

The authors wish to thank Istvan Mucsi for supporting this study.

Funding

Open access funding provided by Corvinus University of Budapest. The data collection was supported by the Hungarian Academy of Sciences (MTA-PPD 462025). Alex Bató’s work was supported by the ÚNKP-21-3 New National Excellence Program of the Ministry for Innovation and Technology from the source of the National Research, Development and Innovation Fund (ÚNKP-21-3-I-SE-78). Fanni Rencz’s work was supported by the János Bolyai Research Scholarship of the Hungarian Academy of Sciences (BO/00304/21) and the New National Excellence Program of the Ministry for Innovation and Technology from the source of the National Research, Development and Innovation Fund (ÚNKP-22-5-CORVINUS-4).

Author information

Authors and Affiliations

Károly Rácz Doctoral School of Clinical Medicine, Semmelweis University, Budapest, Hungary
Alex Bató
Department of Health Policy, Corvinus University of Budapest, 8 Fővám tér, Budapest, 1093, Hungary
Alex Bató, Valentin Brodszky, Balázs Jenei & Fanni Rencz
Institute of Marketing and Communication Sciences, Corvinus University of Budapest, Budapest, Hungary
Ariel Zoltán Mitev

Authors

Alex Bató
View author publications
You can also search for this author in PubMed Google Scholar
Valentin Brodszky
View author publications
You can also search for this author in PubMed Google Scholar
Ariel Zoltán Mitev
View author publications
You can also search for this author in PubMed Google Scholar
Balázs Jenei
View author publications
You can also search for this author in PubMed Google Scholar
Fanni Rencz
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fanni Rencz.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 26 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bató, A., Brodszky, V., Mitev, A.Z. et al. Psychometric properties and general population reference values for PROMIS Global Health in Hungary. Eur J Health Econ 25, 549–562 (2024). https://doi.org/10.1007/s10198-023-01610-w

Download citation

Received: 14 October 2022
Accepted: 07 June 2023
Published: 28 June 2023
Issue Date: June 2024
DOI: https://doi.org/10.1007/s10198-023-01610-w

Keywords

JEL Classification

I10

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Psychometric properties and general population reference values for PROMIS Global Health in Hungary

Abstract

Objectives

Methods

Results

Conclusion

Similar content being viewed by others

Psychometric properties of the patient-reported outcomes measurement information system scale v1.2: global health (PROMIS-GH) in a Dutch general population

Dutch reference values for the Patient-Reported Outcomes Measurement Information System Scale v1.2 - Global Health (PROMIS-GH)

Psychometric evaluation of the Mental Health Quality of Life (MHQoL) instrument in seven European countries

Introduction

Methods

Study design and recruitment

Measures

PROMIS Global Health (PROMIS-GH)

36-item short form health survey (SF-36)

Statistical analyses

Ceiling and floor effect

Unidimensionality

Local independence

Monotonicity

IRT model fit

Measurement invariance

Convergent validity

Establishment of general population reference values

Hypotheses

Results

Sample characteristics (unweighted)

Ceiling and floor effect

Factor and IRT analysis

Unidimensionality

Local independence

Monotonicity

Model fit

Measurement invariance

Convergent validity

Reference values for PROMIS-GH in Hungary

Discussion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 26 KB)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation