Abstract
Objectives
Patient-Reported Outcomes Measurement Information System–Global Health (PROMIS-GH) is a widely used generic measure of health status. This study aimed to (1) assess the psychometric properties of the Hungarian PROMIS-GH and to (2) develop general population reference values in Hungary.
Methods
An online cross-sectional survey was conducted among the Hungarian adult general population (n = 1700). Respondents completed the PROMIS-GH v1.2. Unidimensionality (confirmatory factor analysis and bifactor model), local independence, monotonicity (Mokken scaling), graded response model fit, item characteristic curves and measurement invariance were examined. Spearman’s correlations were used to analyse convergent validity of PROMIS-GH subscales with SF-36v1 composites and subscales. Age- and gender-weighted T-scores were computed for the Global Physical Health (GPH) and Global Mental Health (GMH) subscales using the US item calibrations.
Results
The item response theory assumptions of unidimensionality, local independence and monotonicity were met for both subscales. The graded response model showed acceptable fit indices for both subscales. No differential item functioning was detected for any sociodemographic characteristics. GMH T-scores showed a strong correlation with SF-36 mental health composite score (rs = 0.71) and GPH T-scores with SF-36 physical health composite score (rs = 0.83). Mean GPH and GMH T-scores of females were lower (47.8 and 46.4) compared to males (50.5 and 49.3) (p < 0.001), and both mean GPH and GMH T-scores decreased with age, suggesting worse health status (p < 0.05).
Conclusion
This study established the validity and developed general population reference values for the PROMIS-GH in Hungary. Population reference values facilitate the interpretation of patients’ scores and allow inter-country comparisons.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Health status measures are widely used in clinical practice, observational studies, clinical trials, monitoring general population health, assessing the performance of health systems and in cost-effectiveness analysis [1]. Two forms of health status measures can be distinguished as follows: condition-specific and generic [2]. Condition-specific measures have a specific target population and are able to capture a wide range of symptoms and health problems relevant to a certain condition (e.g. itching in skin diseases or bowel problems in gastrointestinal diseases). In contrast, generic health status measures incorporate health areas that are relevant across different patient populations as well as for the general public (e.g. physical functioning, pain, sleeping). These measures have the advantage of allowing comparisons across different conditions, health interventions and with general population reference values.
In general, a large number of items are needed to precisely assess one’s health status; however, this may lack practical considerations (e.g. time and respondent burden). Therefore, short-form health assessments have gained popularity. Commonly used short generic health status measures include the EQ-5D and SF-36 [3, 4]. These instruments, however, were developed decades ago and one of their common criticisms is that their item development and selection did not benefit from modern psychometric methods, such as item response theory (IRT). The Patient Reported Outcomes Measurement Information System (PROMIS) initiative, funded by the National Institutes of Health in the US, aimed to develop, validate and standardize item banks to measure health outcomes across a broad range of health areas [5]. In the past two decades, over 100 PROMIS item banks and a few fixed-length short-forms have been developed using IRT methods (e.g. PROMIS Global Health, PROMIS-29, PROMIS-43, PROMIS-57) [6, 7]. The main advantages of IRT over classical test theory methods include the estimation of the respondents’ location on an underlying ‘latent’ trait (e.g. health status) based on any subset of items that do not vary depending on the characteristics of the population and the possibility to adaptively assess health status using computerised adaptive testing [8, 9].
PROMIS Global Health (PROMIS-GH) is the shortest PROMIS short-form that measures five generic domains of health (physical functioning, pain, fatigue, emotional distress, social health) using 10 global health items [10]. Its validity, reliability and responsiveness have been confirmed in several populations, including patients with stroke [11], orthopaedic conditions [12,13,14,15,16], amyloidosis [17], inflammatory bowel diseases [18], pregnant women [19] and older adults [20]. The international use of PROMIS-GH has also been expanding outside the US, including studies from the UK [21], Germany [22] and the Netherlands [23]. Furthermore, two countries, the US and the Netherlands have also established general population reference values [24, 25]. So far, PROMIS-GH has not been used in Hungary. Therefore, this study aimed to evaluate the psychometric performance of the Hungarian PROMIS-GH and to develop general population reference values in Hungary.
Methods
Study design and recruitment
The study was approved by the Research Ethics Committee of the Corvinus University of Budapest (no. KRH/343/2020). In November 2020, an online cross-sectional survey was conducted among the Hungarian adult general population. Respondents were recruited by a survey company from members of the largest Hungarian online panel. ‘Soft quotas’ were used for age, gender, education, place of living and geographical region to approximate the distribution of the general population. The inclusion criteria for this study were as follows: (i) ≥ 18 years of age; (ii) place of residence in Hungary; and (iii) giving informed consent prior to data collection.
PROMIS-GH v1.2 was administered as part of a longer survey that aimed to assess the health status and well-being among members of the general public in Hungary [26,27,28]. Respondents were also asked to complete the SF-36v1 and to identify their sociodemographic background (gender, age, education, place of residence, region, employment, household’s net monthly income, marital status, body weight and height) and if they had any chronic health conditions. All respondents first completed SF-36, followed by PROMIS-GH.
Measures
PROMIS Global Health (PROMIS-GH)
The official Hungarian version of PROMIS-GH v1.2 was used as provided by the PROMIS Health Organization. PROMIS-GH consists of 10 items, namely Global01 = general health, Global02 = quality of life, Global03 = physical health, Global04 = mental health, Global05 = satisfaction with discretionary social activities, Global06 = physical function, Global07 = pain, Global08 = fatigue, Global09 = social roles and Global10 = emotional problems [10]. It has two subscales, Global Physical Health (GPH) and Global Mental Health (GMH). GPH consists of Global03, Global06, Global07 and Global08, while GMH includes Global02, Global04, Global05 and Global10. The recall period of the items varies across ‘in general’, the ‘past seven days’ and unspecified. Each item is assessed on a scale with five response levels. For Global01, Global02, Global03, Global04, Global05 and Global09, the best response option is excellent (5), and the worst is poor (1). For Global06 options range from completely (5) to not at all (1), for Global10 from never (5) to always (1) and for Global08 from none (5) to very severe (1). An exception is Global07, which is rated from 0 to 10 (0 = no pain, 10 = worst imaginable pain). We recoded Global07 to a 5-point scale as follows: 0 = 5; 1–3 = 4; 4–6 = 3; 7–9 = 2; 10 = 1 [10]. Raw subscale scores were calculated by adding scores of individual items per subscale. We calculated standardized T-scores from raw scores using the US item calibrations [29]. Mean T-scores therefore represent the mean of the US general population. A higher T-score indicates better health status and a lower T-score refers to worse health status compared to the US general population, where the general population mean is set at 50 with a standard deviation of 10 [24].
36-item short form health survey (SF-36)
The Hungarian version of the SF-36v1 questionnaire was used in our survey with a 4-week recall period. SF-36 is a generic health status measure with 36 items that cover eight health subscales, specifically (1) physical functioning, (2) role limitations due to physical problems (3) bodily pain, (4) general health, (5) vitality, (6) social functioning, (7) role limitations due to emotional problems and (8) mental health [4, 30]. Responses to items are transformed to range from 0 to 100, where higher scores represent better health status. Subscale scores are computed by averaging the respective item scores. SF-36 allows the generation of two summary scores, one for physical health (physical health composite) that includes the first four subscales (1–4) and the other for mental health (mental health composite) including the last four (5–8).
Statistical analyses
In this study, we built on the methods used in earlier psychometric investigations and reference population studies with PROMIS instruments [9, 10, 23, 25]. Data analysis was carried out in R Statistical Software (v4.1.2 Vienna, Austria). We used both classical test theory (e.g. ceiling and floor effect, convergent validity, factor analysis) and IRT methods. Before IRT modelling, we tested the following three assumptions: unidimensionality, local independence and monotonicity [31]. In addition, differential item functioning (DIF) analysis was used to examine measurement invariance. Raw item and subscale scores were used to analyse ceiling and floor effect and for the factor analysis, IRT and DIF analyses. Unweighted T-scores were used to draw histograms and estimate correlations. T-scores were weighted for age group and gender to calculate Hungarian GPH and GMH general population reference values.
Ceiling and floor effect
Ceiling and floor effect were considered if GPH and GMH raw subscale scores exceeded 15% [32].
Unidimensionality
Unidimensionality was tested using confirmatory factor analysis (CFA) and bifactor models. CFA was conducted for the two subscales separately (lavaan package) [33]. Goodness-of-fit was evaluated by the comparative fit index (CFI, cut-off value: > 0.95), Tucker-Lewis index (TLI, cut-off value: > 0.95), the root mean square error of approximation (RMSEA, cut-off value < 0.06), and the standardized root mean squared residual (SRMR, cut-off value: < 0.08) [34, 35]. Further, we used bifactor models to obtain Omega Hierarchical (tentative benchmark > 0.70) and explained common variance (ECV, tentative benchmark > 0.60) [36, 37]. The bifactor models were developed using the psych package [38].
Local independence
To test local independence, we examined the residual correlation matrix resulting from the CFA for both GPH and GMH subscales. Residual correlation values between − 0.20 and 0.20 were considered acceptable supporting local independence [9].
Monotonicity
Monotonicity was analysed using Mokken scale analysis (mokken package). Coefficients (Hi for items, H for subscales) exceeding the cut-off value of > 0.30 were considered acceptable [39, 40].
IRT model fit
Given the polytomous response options of PROMIS-GH items, a graded response model was fitted for both GPH and GMH (mirt package) [41, 42]. To detect item misfit, we used Orlando and Thissen’s S-χ2. Items with p-value < 0.001 were considered misfitting [43]. The same cut-off values were used for fit indices (CFI, TLI, RMSEA, SRMR) as for unidimensionality [34]. Item discrimination (slope, a) and item difficulties (threshold, b) were also computed. Item characteristic curves (ICC) were generated for each item of the two subscales.
Measurement invariance
Measurement invariance was assessed by analysing differential item functioning (DIF) using the lordif package [44]. DIF occurs when the responses of a subgroup of respondents on an item consistently differ from those of another subgroup when controlling for the underlying level of the trait measured by the scale [9]. DIF was analysed for GPH and GMH with the following subgroups: gender (female, male), median age (< 47, ≥ 47 years), education (primary, secondary, tertiary), region (Central, Western and Eastern Hungary), employment (employed, not employed), place of residence (capital, other town, village), marital status (married, not married), and income groups (quintiles, do not know, refused to answer groups). First, we used ordinal logistic regression models without an anchor to evaluate DIF. Where DIF was detected, we repeated the analysis using non-DIF items as an anchor. A Pseudo R2 change ≥ 0.02 was taken as a critical value [45, 46]. The details of the DIF analysis are provided elsewhere [28].
Convergent validity
Spearman’s rank order correlations were used to explore the convergent validity of the two PROMIS-GH subscales with the eight SF-36 subscales and two composite scores. Correlation coefficients (rs) were interpreted as very weak (< 0.20), weak (0.20–0.39), moderate (0.40–0.59) and strong correlation (0.60 ≤) [47].
Establishment of general population reference values
Mean GPH and GMH T-scores were weighted according to gender and age group to derive general population reference values using the US item calibrations [48]. Mean weighted T-scores were computed for subgroups of respondents defined by gender, age groups, education, place of residence, region, employment, income groups, marital status, health status question of SF-36 (item 1), BMI and the presence of any chronic condition. We used Taylor linearization for standard errors, and 95% confidence intervals were calculated for each group. The subgroups were compared using Mann–Whitney or Kruskal–Wallis tests, where applicable.
Hypotheses
Regarding the psychometric properties, we hypothesized (1) no ceiling or floor effects for any subscales, (2) unidimensionality, (3) local independence, (4) monotonicity, (5) acceptable fit to the graded response model, (6) no measurement invariance for any subgroups, (7) moderate or strong correlations between the PROMIS-GH subscales (GPH and GMH) and their corresponding SF-36 composite scores [10, 11, 23, 49]. With regard to the reference values, we hypothesized better self-reported health in men and declining physical health with age [50].
Results
Sample characteristics (unweighted)
Overall, 2502 respondents initiated the survey, 2079 of whom consented and 379 quit before the end of the questionnaire. A total of 1700 respondents completed the survey. The mean age was 47.9 ± 16.3 years, and 56.3% of the respondents were female. Nearly one-third of the sample had tertiary education (32.4%). Half of the respondents were employed (50.9%), 23.5% were retired and 4.4% were students. Overall, 22.4% lived in the capital, 48.2% in other towns and 29.4% in villages. The geographical distribution of the sample was as follows: Western Hungary 29.0%, Central Hungary 33.6%, Eastern Hungary 37.4%. Overall, 67.4% of the sample reported to have any chronic disease. The overall sample showed a good representativeness for the general population in Hungary; however, respondents with a secondary education were slightly underrepresented and those who lived in the capital were somewhat overrepresented (Table 1).
Ceiling and floor effect
The distributions of GPH and GMH raw scores are presented in Fig. 1. We found almost no floor and low ceiling effect for both GPH (0.4% and 4.1%) and GMH subscales (0.5% and 4.8%) (Table 2). Among the items, Global07 demonstrated the highest floor (29.8%). Global06 showed the highest ceiling (58.2%), followed by Global10 (38.3%), Global08 (23.9%) and Global09 (15.8%).
Factor and IRT analysis
Unidimensionality
Fit indices confirmed the unidimensionality of both GPH (CFI = 0.993, TLI = 0.978, SRMR = 0.039) and GMH (CFI = 0.999, TLI = 0.997, SRMR = 0.025), with the exception of RMSEA (GPH 0.114 and GMH 0.071). The hypotheses were supported by the bifactor models, resulting in ECV values higher than the tentative benchmark for both subscales (GPH 0.72 and GMH 0.78). Omega Hierarchical was above the tentative benchmark only for GMH (0.73), but not for GPH (0.66) (Table 3).
Local independence
We found no local dependence between item pairs (Online Resource 1). Eight item pairs had negative residual correlations, but all values were above the value of − 0.20.
Monotonicity
The Mokken scale analysis resulted in coefficients higher than the cut-off value for both subscales (H = 0.531 and 0.638 for GPH and GMH) and items, ranging from Hi = 0.480 (Global08) to 0.717 (Global04) supporting monotonicity (Table 3).
Model fit
Given that unidimensionality, local independence and monotonicity were supported for both subscales, graded response models were fitted. Acceptable fit indices were found for both subscales (GPH: RMSEA = 0.008, SRMR = 0.045, TLI = 0.905, CFI = 0.968 and GMH: RMSEA = 0.012, SRMR = 0.031, TLI = 0.969, CFI = 0.990). A few items showed misfit to the graded response model, namely Global03, Global06, Global02, Global05 and Global10 (p < 0.001) (Table 3). Item difficulties (b) ranged from − 3.7 (Global08) to 1.7 (Global03) for GPH and from − 2.9 (Global10) to 1.7 (Global02) for GMH. Item discrimination (a) values ranged from 1.6 (Global08) to 2.3 (Global07) and from 1.7 (Global10) to 8.0 (Global04) for GPH and GMH, respectively. ICCs for the two subscales are displayed in Fig. 2.
Measurement invariance
After the first step (without anchors), one item (Global07) was flagged for DIF based on age groups, and two items (Global02 and Global10) were flagged for DIF by gender. After the second step (with anchors), DIF was no longer detected for age group and gender, as the Pseudo R2 change was < 0.02 for each analysis. No DIF was detected for education, region, employment, place of residence, marital status or income at all.
Convergent validity
GMH T-score showed a strong correlation with the mental health composite score of SF-36 (rs = 0.708) and GPH T-score with the physical health composite score (rs = 0.829) (Fig. 3). Among the SF-36 subscales, the GPH T-score had the highest correlation with general health (rs = 0.740) and bodily pain (rs = 0.738), while the GMH T-score showed the strongest correlation with mental health (rs = 0.699) and vitality (rs = 0.657).
Reference values for PROMIS-GH in Hungary
Mean total T-scores for GPH and GMH were 49.0 and 47.7, respectively (Table 1). Mean GPH and GMH T-scores of females were lower (47.8 and 46.4) compared to males (50.5 and 49.3) (p < 0.001). We found the highest mean T-scores for GPH and GMH in the 18–24 age group (GPH: 52.3 and GMH: 49.9). Mean GPH and GMH T-scores showed a decreasing trend with age (p < 0.05). Those with higher level of education, living in towns, being student, having higher income and without chronic disease had higher mean T-scores scores for both GPH and GMH (p < 0.001). With regard to BMI, mean GPH T-scores were higher in respondents with normal weight compared to those being underweight or overweight/obese (p < 0.05). Those who reported ‘excellent’ health on the first question of the SF-36 had the highest, while those who reported ‘poor’ had the lowest mean GPH and GMH T-scores (p < 0.001).
Discussion
This study provided a psychometric assessment of the Hungarian version of PROMIS-GH and developed population reference values for its physical and mental health subscales in Hungary. We used both classical test theory and IRT methods to establish the psychometric properties of the measure. PROMIS-GH subscales showed no ceiling and floor effects. All assumptions of IRT (unidimensionality, local independence and monotonicity) were met. Although the Omega Hierarchical value was below the tentative benchmark for GPH, it is important to emphasize that PROMIS-GH is inherently a multidimensional measure, and therefore, individual subscale values within the range of 0.6 and 0.8 seem appropriate both for Omega Hierarchical and ECV [36, 37]. The goodness of fit to the graded response model was acceptable with a few items misfitting. We found no measurement invariance for any sociodemographic characteristics. Strong correlations were found between corresponding PROMIS-GH subscales and SF-36 physical and mental health composite scores. Mean GPH and GMH T-scores in the Hungarian general population were 49.0 and 47.7, respectively.
It is worthwhile to compare our findings about the psychometric performance of PROMIS-GH to those of earlier psychometric studies among members of the general population in the Netherlands and the US [10, 23]. First, unidimensionality was supported with negligible deviations in each study. No local dependence was detected in the Hungarian and Dutch general population samples. The coefficients of the Mokken scale analysis showed that monotonicity was supported in the Hungarian and Dutch samples, and an interesting similarity occurred that in both studies the Global06 item had the smallest distance between the thresholds (Hungarian: − 2.879 to − 0.252; Dutch: − 2.668 to − 0.055). The range of item difficulty values (b) were very similar in all three general population studies with small differences at both ends (US: − 3.0 to 1.5, Hungarian: − 3.7 to 1.7, Dutch: − 3.7 to 1.9) [10, 23]. Ranges of item discrimination parameters (a) were similar for both subscales with slight differences between the US and Dutch studies [10, 23]. While the item discrimination parameters of the Hungarian GPH were in the same range (from 1.6 to 2.3) as the previous two, the Hungarian GMH was somewhat biased due to Global04 (from 1.7 to 8.0), as it usually ranges between 0.5 and 2.5 [31].
The Hungarian overall mean GPH and GMH T-scores (49.0 and 47.7) were slightly lower than those of the US reference population values (GPH: 50.0, GMH: 50.0) and higher than the Dutch values (GPH: 45.2, GMH: 44.7), suggesting that the Hungarian general population is in a better health status than the Dutch (Online Resource 2). By contrast, the standardized Dutch SF-36 physical (49.7) and mental health composite score (52.1) were somewhat higher than the Hungarian scores (48.3 and 48.2), implying that the Dutch general population is in a better health status [51]. However, the Dutch population norm data were collected using the SF-12 and in 1996, which may limit the comparison [52]. A similar pattern was observed for GPH and GMH in the Hungarian general population as in the US and Dutch samples, with a decreasing mean T-score with age, and males reporting better health status than females [25, 53]. However, it should be noted that the US sample (data collected in 2006–2007) and the Dutch sample (data collected in 2016) were obtained considerably earlier compared to this study. In addition, the US calibration sample may not be representative for the European populations. Ultimately, the following characteristics were associated with better physical and mental health in the Hungarian sample: being younger, male, having higher level of education, living in towns, student status, having a higher level of income, having no chronic diseases and reporting better self-perceived health on the first question of the SF-36.
A surprising finding of this study is that the Hungarian general population reported better overall health status than the Dutch general population. Life expectancy in the Netherlands is almost one year higher (81.5) than the weighted EU average (80.6), while life expectancy in Hungary is almost five years (75.7) behind the weighted EU average [54]. In terms of government funding, compulsory and voluntary health insurance and out-of-pocket payments, the Netherlands has one of the highest per capita spending on healthcare in the EU, while Hungary continues to fall behind the EU average in this regard. The greatest contrast might be in the fact that in 2019, 75% of the Dutch general public reported that they were in good health, and this figure did not reach 60% in Hungary in the same year [54]. However, the comparison of PROMIS-GH scores between these two countries is limited by the fact that the Dutch sample was not representative for some important sociodemographic and health-related characteristics of the general population, such as employment and marital status, income and the prevalence of chronic diseases [25].
This study has a few limitations. Our data were collected during the pandemic that might have influenced health status of the general population. However, a recent study has shown that the COVID-19 pandemic had negligible impact on the health status of US patients measured by PROMIS-GH [55]. Furthermore, self-reported health status on the first question of SF-36 in our study was very similar to what had been reported in a pre-COVID online general population survey in Hungary in 2019 [56]. Selection bias might have occurred as online panel data collections may be subject to possible self-selection and underrepresentation of certain groups (e.g. those without internet access) [57]. Another limitation is the cross-sectional nature of this study that prevented us from assessing test–retest reliability and responsiveness of PROMIS-GH.
In conclusion, this study provided an extensive psychometric analysis of the Hungarian PROMIS-GH in a large general population sample and established general population reference values for Hungary. Future research is recommended to replicate this general population study after the COVID-19 pandemic and further test psychometric properties of the Hungarian PROMIS-GH in paper-and-pencil surveys, longitudinal studies and with various patient populations.
Data availability
Data are available from the corresponding author upon a reasonable request.
References
Nelson, E.C., Eftimovska, E., Lind, C., Hager, A., Wasson, J.H., Lindblad, S.: Patient reported outcome measures in practice. BMJ: Br. Med. J. 350, g7818 (2015)
Patrick, D.L., Deyo, R.A.: Generic and disease-specific measures in assessing health status and quality of life. Med. Care 27(3 Suppl), S217-232 (1989)
EuroQol Group: EuroQol—a new facility for the measurement of health-related quality of life. Health Policy 16(3), 199–208 (1990)
Ware Jr, J.E., Sherbourne, C.D.: The MOS 36-item short-form health survey (SF-36): I. conceptual framework and item selection. Med. Care 30(6), 473–483 (1992)
Cella, D., Yount, S., Rothrock, N., Gershon, R., Cook, K., Reeve, B., et al.: The patient-reported outcomes measurement information system (PROMIS): progress of an NIH Roadmap cooperative group during its first two years. Med. Care 45(5 Suppl 1), S3-s11 (2007)
Cella, D., Riley, W., Stone, A., Rothrock, N., Reeve, B., Yount, S., et al.: The patient-reported outcomes measurement information system (PROMIS) developed and tested its first wave of adult self-reported health outcome item banks: 2005–2008. J. Clin. Epidemiol. 63(11), 1179–1194 (2010)
Cella, D., Choi, S.W., Condon, D.M., Schalet, B., Hays, R.D., Rothrock, N.E., et al.: PROMIS® adult health profiles: efficient short-form measures of seven health domains. Value Health 22(5), 537–544 (2019)
Hays, R.D., Morales, L.S., Reise, S.P.: Item response theory and health outcomes measurement in the 21st century. Med. Care 38(9 Suppl), Ii28-42 (2000)
Reeve, B.B., Hays, R.D., Bjorner, J.B., Cook, K.F., Crane, P.K., Teresi, J.A., et al.: Psychometric evaluation and calibration of health-related quality of life item banks: plans for the patient-reported outcomes measurement information system (PROMIS). Med. Care 45(5 Suppl 1), S22–S31 (2007)
Hays, R.D., Bjorner, J.B., Revicki, D.A., Spritzer, K.L., Cella, D.: Development of physical and mental health summary scores from the patient-reported outcomes measurement information system (PROMIS) global items. Qual. Life Res. 18(7), 873–880 (2009)
Katzan, I.L., Lapin, B.: PROMIS GH (patient-reported outcomes measurement information system global health) scale in stroke: a validation study. Stroke 49(1), 147–154 (2018)
Suriani, R.J., Kassam, H.F., Passarelli, N.R., Esparza, R., Kovacevic, D.: Validation of PROMIS Global-10 compared with legacy instruments in patients with shoulder instability. Shoulder Elbow 12(4), 243–252 (2020)
Lapin, B., Davin, S., Stilphen, M., Benzel, E., Katzan, I.L.: Validation of PROMIS CATs and PROMIS global health in an interdisciplinary pain program for patients with chronic low back pain. Spine 45(4), E227-e235 (2020)
Kahan, J.B., Kassam, H.F., Nicholson, A.D., Saad, M.A., Kovacevic, D.: Performance of PROMIS global-10 to legacy instruments in patients with lateral epicondylitis. Arthroscopy 35(3), 770–774 (2019)
Nicholson, A.D., Kassam, H.F., Pan, S.D., Berman, J.E., Blaine, T.A., Kovacevic, D.: Performance of PROMIS global-10 compared with legacy instruments for rotator cuff disease. Am. J. Sports Med. 47(1), 181–188 (2019)
Parker, D.J., Werth, P.M., Christensen, D.D., Jevsevar, D.S.: Differential item functioning to validate setting of delivery compatibility in PROMIS-global health. Qual. Life Res. 31(7), 2189–2200 (2022)
D’Souza, A., Magnus, B.E., Myers, J., Dispenzieri, A., Flynn, K.E.: The use of PROMIS patient-reported outcomes (PROs) to inform light chain (AL) amyloid disease severity at diagnosis. Amyloid 27(2), 111–118 (2020)
IsHak, W.W., Pan, D., Steiner, A.J., Feldman, E., Mann, A., Mirocha, J., et al.: Patient-reported outcomes of quality of life, functioning, and GI/psychiatric symptom severity in patients with inflammatory bowel disease (IBD). Inflamm. Bowel Dis. 23(5), 798–803 (2017)
Slavin, V., Gamble, J., Creedy, D.K., Fenwick, J., Pallant, J.: Measuring physical and mental health during pregnancy and postpartum in an Australian childbearing population - validation of the PROMIS global short form. BMC Pregnancy Childbirth 19(1), 370 (2019)
Allen, J., Alpass, F.M., Stephens, C.V.: The sensitivity of the MOS SF-12 and PROMIS® global summary scores to adverse health events in an older cohort. Qual. Life Res. 27(8), 2207–2215 (2018)
Shim, J., Hamilton, D.F.: Comparative responsiveness of the PROMIS-10 global health and EQ-5D questionnaires in patients undergoing total knee arthroplasty. Bone Joint J. 101(7), 832–837 (2019)
Philipp, R., Lebherz, L., Thomalla, G., Härter, M., Appelbohm, H., Frese, M., et al.: Psychometric properties of a patient-reported outcome set in acute stroke patients. Brain Behav. 11(8), e2249 (2021)
Pellicciari, L., Chiarotto, A., Giusti, E., Crins, M.H.P., Roorda, L.D., Terwee, C.B.: Psychometric properties of the patient-reported outcomes measurement information system scale v.12: global health (PROMIS-GH) in a Dutch general population. Health Qual. Life Outcomes 19(1), 226 (2021)
Liu, H., Cella, D., Gershon, R., Shen, J., Morales, L.S., Riley, W., et al.: Representativeness of the patient-reported outcomes measurement information system internet panel. J. Clin. Epidemiol. 63(11), 1169–1178 (2010)
Elsman, E.B.M., Roorda, L.D., Crins, M.H.P., Boers, M., Terwee, C.B.: Dutch reference values for the patient-reported outcomes measurement information system scale v.12 - global health (PROMIS-GH). J. Patient-Report. Outcomes 5(1), 38 (2021)
Rencz, F., Janssen, M.F.: Analyzing the pain/discomfort and anxiety/depression composite domains and the meaning of discomfort in the EQ-5D: a mixed-methods study. Value Health 25(12), 2003–2016 (2022)
Rencz, F., Brodszky, V., Janssen, M.F.: A direct comparison of the measurement properties of EQ-5D-5L, PROMIS-29+2 and PROMIS Global Health instruments and EQ-5D-5L and PROPr utilities in a general population sample. Value Health (2023). https://doi.org/10.1016/j.jval.2023.02.002
Jenei, B., Bató, A., Mitev, A.Z., Brodszky, V., Rencz, F.: Hungarian PROMIS-29+2: psychometric properties and population reference values. Qual. Life Res. (2023). https://doi.org/10.1007/s11136-023-03364-7
HealthMeasures (2017). PROMIS global health scoring manual. http://www.healthmeasures.net/images/PROMIS/manuals/PROMIS_Global_Scoring_Manual.pdf. Accessed 7 Sept 2021
Brazier, J.E., Harper, R., Jones, N., O’cathain, A., Thomas, K., Usherwood, T., et al.: Validating the SF-36 health survey questionnaire: new outcome measure for primary care. BMJ 305(6846), 160–164 (1992)
Reeve, B.B., Fayers, P.: Applying item response theory modeling for evaluating questionnaire item and scale properties. Assess. Qual. Life Clin Trials: Methods Pract. 2, 55–73 (2005)
Terwee, C.B., Bot, S.D.M., de Boer, M.R., van der Windt, D.A.W.M., Knol, D.L., Dekker, J., et al.: Quality criteria were proposed for measurement properties of health status questionnaires. J. Clin. Epidemiol. 60(1), 34–42 (2007)
Rosseel, Y.: lavaan: an R package for structural equation modeling. J. Stat. Softw. 48(2), 1–36 (2012)
Hu, L.T., Bentler, P.M.: Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ. Model.: A Multidiscip. J. 6(1), 1–55 (1999)
Rosseel, Y.: Lavaan: an R package for structural equation modeling and more. Version 0.5–12 (BETA). J. Stat. Softw. 48(2), 1–36 (2012)
Reise, S.P., Scheines, R., Widaman, K.F., Haviland, M.G.: Multidimensionality and structural coefficient bias in structural equation modeling: a bifactor perspective. Educ. Psychol. Measur. 73(1), 5–26 (2013)
Rodriguez, A., Reise, S.P., Haviland, M.G.: Applying bifactor statistical indices in the evaluation of psychological measures. J. Pers. Assess. 98(3), 223–237 (2016)
Revelle, W.R.: psych: Procedures for psychological, psychometric, and personality research. Northwestern University, Evanston, Illinois. R package version 2.3.3 (2023). https://CRAN.R-project.org/package=psych. Accessed 17 Jun 2023
Mokken, R.J.: A theory and procedure of scale analysis: with applications in political research: De Gruyter Mouton. ISBN: 9783110813203 (2011)
van der Ark, L.A.: Mokken scale analysis in R. J. Stat. Softw. 20(11), 1–19 (2007)
Chalmers, R.P.: mirt: a multidimensional item response theory package for the R environment. J. Stat. Softw. 48(6), 1–29 (2012)
Samejima, F.: Estimation of latent ability using a response pattern of graded scores. Psychometrika Monogr. Suppl. 34(4, Pt. 2), 100 (1969)
Kang, T., Chen, T.T.: Performance of the generalized S-X2 item fit index for the graded response model. Asia Pac. Educ. Rev. 12(1), 89–96 (2011)
Choi, S.W., Gibbons, L.E., Crane, P.K.: Lordif: an R package for detecting differential item functioning using iterative hybrid ordinal logistic regression/item response theory and Monte Carlo simulations. J. Stat. Softw. 39(8), 1 (2011)
Crane, P.K., Gibbons, L.E., Jolley, L., van Belle, G.: Differential item functioning analysis with ordinal logistic regression techniques: DIFdetect and difwithpar. Med. Care 44(11 Suppl 3), S115–S123 (2006)
Kopf, J., Zeileis, A., Strobl, C.: Anchor selection strategies for DIF analysis: review, assessment, and new approaches. Educ. Psychol. Measur. 75(1), 22–56 (2015)
Swinscow, T.D.V., Campbell, M.J. (2002). Statistics at square one: Bmj London. 0727915525
Hungarian central statistical office (2016). Microcensus 2016. Available from: https://www.ksh.hu/docs/eng/xftp/idoszaki/microcensus2016/microcensus_2016_3.pdf. Accessed 7 Sept 2021.
Oosterveer, D.M., Arwert, H., Terwee, C.B., Schoones, J.W., Vlieland, T.P.M.V.: Measurement properties and interpretability of the PROMIS item banks in stroke patients: a systematic review. Qual. Life Res. 31(12), 3305–3315 (2022)
Szende, A., Németh, R.: Health-related quality of life of the Hungarian population. Orv. Hetil. 144(34), 1667–1674 (2003)
Gandek, B., Ware, J.E., Aaronson, N.K., Apolone, G., Bjorner, J.B., Brazier, J.E., et al.: Cross-validation of item selection and scoring for the SF-12 health survey in nine countries: results from the IQOLA project. J. Clin. Epidemiol. 51(11), 1171–1178 (1998)
Gandek, B., Ware, J.E., Jr.: Methods for validating and norming translations of health status questionnaires: the IQOLA project approach. J. Clin. Epidemiol. 51(11), 953–959 (1998)
HealthMeasures (2021). PROMIS score cut points. https://www.healthmeasures.net/score-and-interpret/interpret-scores/promis/promis-score-cut-points, Accessed 7 Sept 2021
OECD: Health at a Glance 2021: OECD Indicators. OECD Publishing, Paris (2021). https://doi.org/10.1787/ae3016b9-en
Lapin, B.R., Tang, W.H.W., Honomichl, R., Hogue, O., Katzan, I.L.: Evidence of stability in patient-reported global health during the COVID-19 Pandemic. Value Health 24(11), 1578–1585 (2021)
Rencz, F., Tamási, B., Brodszky, V., Ruzsa, G., Gulácsi, L., Péntek, M.: Did You get what you wanted? Patient satisfaction and congruence between preferred and perceived roles in medical decision making in a hungarian national survey. Value Health Reg. Issues 22, 61–67 (2020)
Bethlehem, J.: Selection bias in web surveys. Int. Stat. Rev. 78(2), 161–188 (2010)
Acknowledgements
The authors wish to thank Istvan Mucsi for supporting this study.
Funding
Open access funding provided by Corvinus University of Budapest. The data collection was supported by the Hungarian Academy of Sciences (MTA-PPD 462025). Alex Bató’s work was supported by the ÚNKP-21-3 New National Excellence Program of the Ministry for Innovation and Technology from the source of the National Research, Development and Innovation Fund (ÚNKP-21-3-I-SE-78). Fanni Rencz’s work was supported by the János Bolyai Research Scholarship of the Hungarian Academy of Sciences (BO/00304/21) and the New National Excellence Program of the Ministry for Innovation and Technology from the source of the National Research, Development and Innovation Fund (ÚNKP-22-5-CORVINUS-4).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Bató, A., Brodszky, V., Mitev, A.Z. et al. Psychometric properties and general population reference values for PROMIS Global Health in Hungary. Eur J Health Econ 25, 549–562 (2024). https://doi.org/10.1007/s10198-023-01610-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10198-023-01610-w