Abstract
Introduction
Validated patient report tools for quantifying patient experiences of itch in prurigo nodularis (PN) are limited. This study aimed to evaluate the validity of the 11-point peak pruritus numerical rating scale (PP NRS) as a single-item patient-reported outcome (PRO) measure for assessing itch severity in PN.
Methods
Content validity of the PP NRS was evaluated through qualitative interviews with adults with PN. The PP NRS was then psychometrically evaluated using data from a placebo-controlled trial of nemolizumab in adults with PN, during which patients completed the PP NRS daily. Meaningful within-patient change was estimated from the qualitative interviews and by anchor- and distribution-based analyses of trial data.
Results
The interview participants (N = 21) all understood the PP NRS and reported itch as their worst symptom overall. The PP NRS showed good test–retest reliability and demonstrated convergent validity and known-groups validity. PP NRS scores improved more in patients classified as “improved” on other clinical outcome measures than in those classified as “worsened/unchanged”. Triangulation of the different estimates identified a 2- to 5-point decrease in PP NRS score as a meaningful within-patient change threshold.
Conclusion
The PP NRS is a content-valid and reliable PRO measure for quantifying itch severity in adults with PN in clinical trials.
Trial Registration Number
NCT03181503.
Plain Language Summary
Prurigo nodularis is a skin disease where the skin becomes inflamed and very itchy. Itching and scratching can ruin the lives of people with prurigo nodularis. For doctors to know how well treatments for prurigo nodularis are working, they need to be able measure how bad their patients’ itching is. Here we tested whether a tool called the peak pruritus numeric rating scale (PP NRS) can be used to reliably measure itching in patients with prurigo nodularis. The PP NRS asks patients to rate their itch “at the worst moment during the previous 24 h” on a scale from 0 (“no itch”) to 10 (“worst itch imaginable”). We interviewed 21 patients with prurigo nodularis. Overall, itch was their worst symptom. They were able to use the PP NRS with few or no problems. To further explore whether the PP NRS can be used by patients with prurigo nodularis, we also obtained data from a clinical study. In this clinical study, patients with prurigo nodularis received a drug that might one day be used to treat their disease. They completed the PP NRS daily during the time they received the drug. When we analyzed the data from the clinical study, we found that the PP NRS has the properties a tool like this needs to be useful. We also concluded that a decrease of 2–5 points on the PP NRS scale would be a meaningful improvement in itching for patients with prurigo nodularis.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Why carry out this study? |
Patients with prurigo nodularis (PN) have impaired quality of life due to severe itch, but options for quantifying their experiences of itch are limited. |
This study aimed to validate the peak pruritus numeric rating scale (PP NRS), a single-item patient-reported outcome measure for assessing maximum itch severity, in patients with PN. |
What was learned from the study? |
Adults with PN who participated in qualitative interviews reported that itch was their worst symptom overall and demonstrated good understanding of the PP NRS. |
In a psychometric evaluation using data from a phase 2 trial in adults with PN, the PP NRS showed good reliability, validity, and responsiveness. |
This study confirms that the PP NRS is a content-valid and reliable PRO measure that can be used in clinical trials. |
Introduction
Prurigo nodularis (PN) is a chronic intensely pruritic inflammatory skin disease characterized by the presence of numerous hyperkeratotic and/or fibrotic nodules, commonly located on the extremities and trunk [1]. When treating PN, the primary goal is to reduce skin inflammation and break the itch–scratch cycle [2]. This goal can be difficult to attain, as there are currently no US Food and Drug Administration (FDA)-approved therapies for PN, and patients are often inadequately managed on off-label therapies, some of which can cause severe side effects [3].
PN has a devastating impact on quality of life [3,4,5]. Compared to people without PN, people with PN are more likely to have comorbidities and use more healthcare resources [6,7,8]. They also have higher rates of anxiety and depression [9] and often suffer from disrupted sleep [10,11,12]. However, PN remains understudied compared to other inflammatory skin diseases, and there is an unmet need for validated patient-reported outcome (PRO) measures to assess the impact of the disease on quality of life and to demonstrate the benefits of novel therapeutics from the patient’s perspective [13].
The peak pruritus numerical rating scale (PP NRS) is a single-item PRO measure for assessing the maximum severity of itch in people with pruritic skin disorders. It has been validated in atopic dermatitis (AD) [14] and plaque psoriasis [15], but not in PN. To determine whether the PP NRS is suitable for use in clinical trials of treatments for improving itch in PN, we evaluated its content validity through qualitative interviews and assessed its psychometric properties using data from a recent phase 2 trial of the humanized anti-human IL-31RA monoclonal antibody nemolizumab in people with moderate-to-severe PN [16]. We also derived a threshold for meaningful within-patient change.
Methods
The human studies reported here were performed in accordance with the Helsinki Declaration of 1964 and its later amendments. Institutional review board/ethics committee approval was obtained (Advarra Inc, protocol #PRE00528446), and all participants provided written informed consent to participate.
Qualitative Interviews
The PP NRS assesses the intensity of itch “at the worst moment during the previous 24 h” on a scale of 0 (“no itch”) to 10 (“worst itch imaginable”). Content validity of the PP NRS was evaluated through qualitative telephone interviews with US adults with PN who scored the PP NRS as ≥ 7 points at screening. Interview participants were recruited via patient associations, patient panels, and social media by convenience sampling. Interviews were conducted in English by researchers experienced in qualitative research (CS, LS, RC, RW; one male, three female) using a semi-structured interview guide. The interviews combined concept elicitation with cognitive debriefing of the PP NRS. Participants were asked which symptoms of PN they suffered from and which of these they considered to be the worst. They then completed a paper copy of the PP NRS and answered questions relating to ease of use and understanding. They also answered questions about the PP NRS score change that would reflect a meaningful improvement, starting from the PP NRS score they recorded during the interview.
The interviews were audio-recorded and transcribed. After removal of any personally identifiable information, the interview transcripts were uploaded, coded, and analyzed by content analysis using ATLAS.ti (ATLAS.ti Scientific Software Development, Berlin, Germany). Quantitative data (including PP NRS responses) was collected on paper forms, which were entered into DataFax (DF/Net Research, Seattle, WA) to convert the data into an electronic format. The data was then analyzed using SAS® version 9.4 (SAS Institute, Cary, NC). The protocol for the interviews was approved by an institutional review board (Advarra, Columbia, MD).
Psychometric Evaluation of the PP NRS
Study Design
Psychometric properties of the PP NRS were analyzed using data from a phase 2 multicenter, randomized, placebo-controlled, double-blinded trial of nemolizumab in adults with severe pruritus (NCT03181503) [16]. Patients received subcutaneous injections of nemolizumab 0.5 mg/kg or placebo at day 1 (baseline), week 4, and week 8. PRO assessments were performed at day 1 and at weeks 4, 8, 12, and 18. The trial protocol was approved by the ethics committee at each participating institution.
Assessments
Patients completed the PP NRS daily in the morning throughout the trial. Other PRO assessments included the Average Pruritus (AP) NRS, AP Verbal Rating Scale (VRS), and PP VRS [17]; Dynamic Pruritus Score (DPS) [18]; and Dermatology Life Quality Index (DLQI) [19] (Table S2 in the supplementary material). Clinician-reported outcomes included the Investigator Global Assessment (IGA) and Prurigo Activity Score (PAS) [20], now known as the Prurigo Activity and Severity Score [21].
Psychometric Analyses
Test–retest reliability was examined using data for “stable” patients, defined on the basis of no score change for the IGA or for the PAS item on number of prurigo lesions in a representative area (hereafter “number of lesions”) between baseline and week 4 or a score change of ≤ 1 point on the AP VRS or PP VRS. Data were analyzed by paired t test, and intraclass correlation coefficients (ICCs) were estimated using a two-way mixed-effects analysis of variance (ANOVA) model with absolute agreement [22, 23]. Agreement base on ICC values was categorized as < 0 = none, 0–0.20 = slight, 0.21–0.40 = fair, 0.41–0.60 = moderate, 0.61–0.80 = substantial, > 0.8 = near-perfect [24].
Convergent and divergent validity was assessed by calculating Spearman’s rank-order correlation coefficients between PP NRS average weekly score and PP VRS, AP VRS, and AP NRS average weekly scores; DLQI total score; DLQI item 1 (itch) score; IGA score; and scores for the PAS items on number of lesions, percentage of prurigo lesions with excoriations/crusts (hereafter “lesions with excoriations/crusts”), and percentage of healed prurigo lesions (hereafter “healed lesions”). Spearman’s rank-order correlation coefficients were calculated for IGA and PAS number of lesions scores to assess divergent validity, and for PP VRS, AP VRS, AP NRS, and DLQI scores to assess convergent validity. Correlations (r) were categorized as < 0.30 = weak, ≥ 0.30 to < 0.50 = moderate, and ≥ 0.50 = strong [25]. It was hypothesized that PP NRS score would have moderate to strong correlations with the PROs (which measure similar constructs) and weak correlations with the clinician-reported outcomes (which measure more distal constructs).
Known-groups validity was assessed by comparing baseline PP NRS scores between subgroups of patients categorized according to PP VRS, AP VRS, and AP NRS average weekly scores, as well as DLQI total score, DLQI item 1 score, IGA score, and PAS item scores. Subgroups were compared by ANOVA adjusted for multiple comparisons by the Scheffé method [26].
To analyze responsiveness, we first calculated Spearman’s rank-order correlation coefficients between change from baseline to week 4 in PP NRS average weekly score and changes from baseline to week 4 for other outcome measures. Outcome measures for which r was ≥ 0.3 were included as anchors in the responsiveness analysis. For each anchor, ANOVA was used to compare changes in PP NRS average weekly scores from baseline to week 4 between patients whose score on the anchor was improved and those whose score was unchanged or worsened. Effect sizes and standardized response means were calculated.
Statistical Procedures
Analyses were conducted with SAS version 9.4 using data from all randomized patients who contributed baseline data for the PP NRS. All statistical tests were at a two-sided significance level of 0.05. Missing data were not imputed.
Meaningful Within-Patient Change
As recommended by the FDA [27], anchor-based methods were used in the primary quantitative analysis to estimate meaningful within-patient change thresholds for the PP NRS. Estimates were derived on the basis of the mean change in PP NRS score from baseline to week 4 in subgroups of patients who experienced meaningful change on other outcome measures. The anchor-based analyses were supplemented using distribution-based methods to calculate 0.5 × standard deviation (SD) and standard error of measurement (SEM) (SD × √ [1 − ICC], where the ICC was calculated using data for patients with no change in PP VRS score between baseline and week 4). Anchor- and distribution-based estimates were triangulated with meaningful improvement data from the qualitative interviews to derive a threshold for meaningful within-patient change.
Results
Qualitative Interviews
Participants
The 21 interview participants had a mean age of 53.1 years (range 27–76) and 15 (71%) were female. Eighteen participants (86%) were White, two (10%) were Black, and one (5%) had mixed Black/Asian heritage. Only one participant was newly diagnosed with PN (within the previous 6 months). Most participants rated the severity of their PN at the time of the interview as either moderate (n = 7/21, 33%) or severe (n = 10/21, 48%).
Concept Elicitation
Itch, pain, bleeding/scabbing, and dry skin were reported by all participants. Other frequently reported symptoms were lumps/bumps (20 participants, 95%), having a crust on the skin (20 participants), burning (19 participants, 90%), stinging (19 participants), lesions/sores (18 participants, 86%), and skin discoloration (18 participants). Of the 18 participants who indicated what their worst symptoms were, 15 (83%) mentioned itch as their worst symptom or one of their worst symptoms. Many participants described their itch using terms such as “painful”, “excruciating”, and “sore”. Others described it as “uncontrollable”, “inescapable”, “intense”, or “extreme”. A few described it as “uncomfortable” or “annoying”.
Of the 15 participants who described the duration of their itch, 8 (53%) reported that it was “constant”, “present all the time,” or lasted “all day”. Three participants (20%) reported that the duration varied or was inconsistent or unpredictable. Fourteen of the 17 participants who discussed the frequency of their itch (82%) stated that they experienced it every day. The other three participants reported having itch “4 days per week”, “several times per week”, and “2 to 3 days per month”. The most frequently reported locations of itch were the arms (8 of 13 participants who reported on location, 62%) and legs (5 participants, 38%). Three participants (23%) described experiencing itch all over their bodies.
Cognitive Debriefing of the PP NRS
When asked to score the PP NRS during the interview, 7 participants (33%) recorded a score of 10, 3 participants (14%) a score of 9, 7 participants a score of 8, 3 participants a score of 7, and 1 participant (5%) a score of 6. The mean (SD) score was 8.6 (1.2).
Most participants (n = 20/21, 95%) demonstrated that they understood the intended meaning of the PP NRS and had no issues with comprehension. All 14 participants who responded when probed further on their understanding of “at the worst moment” demonstrated understanding of this part of the question.
Eighteen of the 19 participants who provided feedback on the clarity of the PP NRS (95%) reported that it was clear and indicated that there was nothing that was difficult to understand or respond to. One participant (5%) expressed that it would be difficult to “describe something that would be in the middle of the scale.” All 16 participants who responded to probing on the “no itch” anchor and all 13 who responded to probing on the “worst itch imaginable” anchor were able to describe the anchor as intended and had no issues with the meaning.
Eighteen of the 20 participants who described their understanding of the 24-h recall period (90%) described it as the past day or as the period from the same time the previous day. Two participants misunderstood the recall period as “Just a few hours ago, when I was having an episode” or “when […] my family were yelling at me for touching my skin”.
Nineteen participants (90%) reported having no issue with selecting a response. Some participants explained that it was “easy” and that there was nothing confusing about the scale. However, two participants (10%) described having an issue when selecting a response. One stated that “it’s so hard to put a number on the itch” and the other explained how distractions during the day meant that they could “more easily give […] a range than […] a certain number”.
Psychometric Evaluation
Patients
The present analysis included 67 of 70 patients randomized in the phase 2 trial (Table 1). Mean (SD) age was 55.7 (16.0) years. Sixty percent of patients were female and most (97%) were White. Most patients (58%) had fewer than 100 nodules on their body. IGA score at baseline was 3 (moderate) for 52% of patients and 4 (severe) for 48%. The mean (SD) number of prurigo lesions in a representative area was 20.2 (16.0). Three-quarters of patients had more than 50% prurigo lesions with excoriations/crusts, and more than half of patients had less than 25% healed lesions.
Psychometric Properties
Test–retest reliability of the PP NRS was substantial based on PROs and weak to fair based on clinician-reported outcomes: ICC values for the correlations between baseline and week 4 scores were 0.73 for PP VRS average weekly score, 0.76 for AP VRS average weekly score, 0.20 for PAS number of lesions, and 0.25 for IGA score (Table 2).
The PP NRS demonstrated convergent and divergent validity. At baseline, PP NRS average weekly score was strongly correlated with PP VRS average weekly score (r = 0.75), AP VRS average weekly score (r = 0.73), DLQI item 1 score (r = 0.54), and AP NRS average weekly score (r = 0.85); moderately correlated with DLQI total score (r = 0.36) and IGA score (r = 0.33); and weakly correlated with PAS item scores (r = − 0.09–0.05) (Table 3).
PP NRS average weekly scores were higher in subgroups of patients with worse itch based on PP VRS, AP VRS, and AP NRS scores than in subgroups with less severe itch (p < 0.0001) (Table 4). Known-groups validity was also demonstrated on the basis of DLQI total score (p = 0.0054), DLQI item 1 score (p < 0.0001), and IGA score (p = 0.0027), but not based on PAS item scores.
A preliminary analysis was conducted to identify outcome measures to include in the responsiveness analysis (those for which r was ≥ 0.3). Change in PP NRS average weekly score from baseline to week 4 was strongly correlated with changes in PP VRS average weekly score (r = 0.88), AP VRS average weekly score (r = 0.84), DLQI item 1 score (r = 0.69), IGA score (r = 0.55), and DPS (r = − 0.65) (Table S1 in the supplementary material). The correlation with PAS number of lesions was weak (r = 0.29). PAS number of lesions was therefore excluded from the responsiveness analysis.
The PP NRS demonstrated responsiveness based on changes in other outcome measures with correlation coefficients ≥ 0.3 (Table 5). Decreases in mean PP NRS weekly average score were significantly higher in patients classified as “improved” (− 3.8 to − 4.8) than in those classified as “worsened/no change” (− 0.2 to − 2.1; p = 0.0010 for PAS lesions with excoriations/crusts, p < 0.0001 for all other outcome measures). Effect sizes were also higher for “improved” subgroups (− 3.7 to − 4.7) than for “worsened/no change” ones (− 0.2 to − 2.0).
Meaningful Within-Patient Change
Qualitative Approach
Participants were asked to consider the PP NRS score change they would view as meaningful, starting from the score they registered during the interview. Of the 18 participants who reported on the smallest score change they would be satisfied or content with when receiving a new treatment, one (6%) indicated that they would be satisfied or content with a 1-point improvement, two (11%) a 2-point improvement, two (11%) a 3-point improvement, and 13 (72%) a more than 3-point improvement (Fig. S1 in the supplementary material). The average smallest score improvement they would be satisfied or content with when receiving a new treatment was 4.9 points.
Twenty of the 21 participants were further probed on meaningfulness of 1-, 2-, and 3-point score changes. Three-quarters of participants (n = 15/20, 75%) indicated that a 3-point improvement would be meaningful. They described that a 3-point improvement would mean “less itch on [their] skin”, “fewer lesions”, or “no burning and no itching”. Most participants (n = 14/20, 70%) expressed that a 2-point improvement would be meaningful. They explained that this would mean “less itch”, “itch severity and nodules reduced”, “less discomfort”, “feeling more comfortable to go out with friends”, or “be[ing] happy”. Only a minority of participants (n = 8/21, 38%) indicated that a 1-point improvement would be meaningful. Participants expressed that a 1-point improvement would mean “feeling more comfortable” or experiencing more calmness”, or that it would be “encouraging knowing that your itch is getting milder”.
Quantitative Approach
In anchor-based analyses, meaningful within-patient change for PP NRS score was estimated as − 2.2 based on the PP VRS, − 2.4 based on the AP VRS, − 4.6 based on the IGA, − 2.0 based on the DPS, and − 1.9 based on the DLQI (Table 6). Distribution-based estimates for the PP NRS were 0.49 for 0.5 × SD and 0.52 for SEM.
Triangulation of Estimates
Results of the anchor-based, distribution-based, and qualitative analyses were triangulated to obtain a range of thresholds for meaningful within-patient change on the PP NRS. The results of this triangulation suggested that a 2- to 5-point decrease in PP NRS score is a meaningful improvement for the target population.
Discussion
PN is characterized by an intractable itch–scratch cycle that can ruin patients’ lives [2, 12, 28]. Validated PRO measures for assessing itch [11, 12] are needed to properly evaluate the patient benefit of new therapeutic options. Here, we conducted a qualitative and psychometric evaluation of the PP NRS as part of a stepwise strategy for establishing its suitability and fitness for purpose for clinical trials in PN.
Participants in the qualitative interviews experienced myriad skin symptoms, the worst of which was itching. They demonstrated a good understanding of the PP NRS, including the recall period, anchors, and response scale. Moreover, most participants expressed that the PP NRS was clear and that it was straightforward to select a response.
The qualitative study found that a 3-point reduction in PP NRS score would be a meaningful improvement for most participants. However, the results also suggested that a 3-point improvement would not necessarily be a change that patients would be satisfied or content with. Indeed, 72% of participants would expect a change of more than 3 points on the PP NRS with a new treatment.
The PP NRS had good test–retest reliability, with ICC values for the PP VRS and AP VRS above the standard threshold of 0.70 [29]. Reliability based on PAS number of lesions and IGA scores was weaker, although this was not surprising, because the aspects of PN these clinician-reported outcomes capture are more closely related to disease activity than to itch. The PP NRS also had good construct validity, and known-groups validity was demonstrated on the basis of PROs and some but not all clinician-reported outcomes. For each of the anchors included in the responsiveness analysis, the PP NRS was able to differentiate between patients whose score on the anchor improved and those whose score was unchanged or worsened.
The identified 2- to 5-point meaningful within-patient change threshold is aligned with the threshold of a 4-point decrease in PP NRS score derived from anchor-based analyses in adults with moderate-to-severe AD [14] and plaque psoriasis [15]. The 4-point threshold has been accepted by European and US regulators in other skin conditions as a meaningful within-patient change threshold to demonstrate treatment effect [30,31,32,33].
Until recently, validated PRO measures for assessing itch in PN were not available. However, the worst itch (WI) NRS [34] has now been validated using data from a placebo-controlled trial of the small-molecule inhibitor serlopitant in patients with treatment-refractory PN [35]. The PP NRS and WI NRS have similar response scales and recall periods and appear to be conceptually equivalent based on cognitive debriefing and usability testing in US adults with moderate-to-severe PN [36]. However, they differ in their wordings: the PP NRS uses itch “at the worst moment” and the WI NRS uses itch “at its most intense”. For economic modelling and comparative effectiveness analyses, an equivalence study would be useful to provide evidence that the two itch NRSs measure the same concept and provide comparable data.
Limitations of the present study include the small sample. Also, the 4-week time interval in the test–retest analysis is longer than the typical 1 to 2 weeks between assessments [37], although the effect of this difference is likely conservative. Finally, the lack of analyses by sex potentially limits the generalizability of our findings, although the female predominance in our study samples is aligned with the higher prevalence of PN in women compared to men [1, 9].
Conclusion
This study provides strong evidence that the PP NRS is a content-valid and reliable PRO measure that can be used by adults with PN in clinical trials. Moreover, it provides a meaningful within-patient change threshold estimate for the PP NRS, based on triangulation of quantitative estimates and meaningful improvement data derived from a qualitative analysis.
References
Huang AH, Williams KA, Kwatra SG. Prurigo nodularis: epidemiology and clinical features. J Am Acad Dermatol. 2020;83:1559–65.
Kwatra SG. Breaking the itch-scratch cycle in prurigo nodularis. N Engl J Med. 2020;382:757–8.
Williams KA, Huang AH, Belzberg M, Kwatra SG. Prurigo nodularis: pathogenesis and management. J Am Acad Dermatol. 2020;83:1567–75.
Aggarwal P, Choi J, Sutaria N, et al. Clinical characteristics and disease burden in prurigo nodularis. Clin Exp Dermatol. 2021;46:1277–84.
Whang KA, Le TK, Khanna R, et al. Health-related quality of life and economic burden of prurigo nodularis. J Am Acad Dermatol. 2022;86:573–80.
Huang AH, Canner JK, Williams KA, Grossberg AL, Kwatra MM, Kwatra SG. Healthcare resource utilization and payer cost analysis of patients with prurigo nodularis. Br J Dermatol. 2020;183:182–4.
Wongvibulsin S, Sutaria N, Williams KA, et al. A nationwide study of prurigo nodularis: disease burden and healthcare utilization in the United States. J Invest Dermatol. 2021;141(2530–3):e1.
Morgan CL, Thomas M, Stander S, et al. Epidemiology of prurigo nodularis in England: a retrospective database analysis. Br J Dermatol. 2022;187:188–95.
Huang AH, Canner JK, Khanna R, Kang S, Kwatra SG. Real-world prevalence of prurigo nodularis and burden of associated diseases. J Invest Dermatol. 2020;140(480–3):e4.
Gwillim EC, Janmohamed SR, Yousaf M, Patel KR, Silverberg JI. The impact of prurigo nodularis on sleep disturbance and related impact: a systematic review. J Eur Acad Dermatol Venereol. 2020;34:e815–7.
Todberg T, Zachariae C, Skov L. Treatment and burden of disease in a cohort of patients with prurigo nodularis: a survey-based study. Acta Derm Venereol. 2020;100:adv00119.
Pereira MP, Hoffmann V, Weisshaar E, et al. Chronic nodular prurigo: clinical profile and burden. A European cross-sectional study. J Eur Acad Dermatol Venereol. 2020;34:2373–83.
Chanis R, Dias Barbosa C, Wilson R, Jabbar-Lopez Z, Puelles J, Gabriel S. Exploring symptoms, impacts, and patient-reported outcome (PRO) measures in prurigo nodularis: a review of the literature. Virtual ISPOR Europe 2020; 16–19 November, 2020
Yosipovitch G, Reaney M, Mastey V, et al. Peak Pruritus Numerical Rating Scale: psychometric validation and responder definition for assessing itch in moderate-to-severe atopic dermatitis. Br J Dermatol. 2019;181:761–9.
Kimball AB, Naegeli AN, Edson-Heredia E, et al. Psychometric properties of the Itch Numeric Rating Scale in patients with moderate-to-severe plaque psoriasis. Br J Dermatol. 2016;175:157–62.
Ständer S, Yosipovitch G, Legat FJ, et al. Trial of nemolizumab in moderate-to-severe prurigo nodularis. N Engl J Med. 2020;382:706–16.
Phan NQ, Blome C, Fritz F, et al. Assessment of pruritus intensity: prospective study on validity and reliability of the visual analogue scale, numerical rating scale and verbal rating scale in 471 patients with chronic pruritus. Acta Derm Venereol. 2012;92:502–7.
Ständer S, Blome C, Anastasiadou Z, et al. Dynamic pruritus score: evaluation of the validity and reliability of a new instrument to assess the course of pruritus. Acta Derm Venereol. 2017;97:230–4.
Finlay AY, Khan GK. Dermatology Life Quality Index (DLQI)–a simple practical measure for routine clinical use. Clin Exp Dermatol. 1994;19:210–6.
Pölking J, Zeidler C, Schedel F, et al. Prurigo Activity Score (PAS): validity and reliability of a new instrument to monitor chronic prurigo. J Eur Acad Dermatol Venereol. 2018;32:1754–60.
KCP. Prurigo Activity and Severity Score (PAS). Version 1.2 02.03.2021. Center for Chronic Pruritus, University Hospital Münster; 2021.
Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979;86:420–8.
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15:155–63.
Landis JR, Koch GG. The measurement of observer agreement for categorical data. Biometrics. 1977;33:159–74.
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Hillsdale: Lawrence Erlbaum; 1988.
Scheffé H. The analysis of variance. New York: Wiley; 1999.
FDA. Guidance for industry patient-reported outcome measures: use in medical product development to support labeling claims. 2009. https://www.fda.gov/media/77832/download. Accessed 3 Mar 2022.
Kwon CD, Khanna R, Williams KA, Kwatra MM, Kwatra SG. Diagnostic workup and evaluation of patients with prurigo nodularis. Medicines (Basel). 2019;6:97.
Nunnally JC. The assessment of reliability. In: Nunnally JC, Bernstein IH, editors. Psychometric theory. New York: McGraw Hill; 1994. p. 248–92.
Simpson EL, Ständer S, Yosipovitch G, et al. Peak pruritus numeric rating scale (PP-NRS) response with abrocitinib in patients with moderate-to-severe atopic dermatitis (AD): results from a randomized, phase 3 clinical trial. 2020 San Diego Dermatology Symposium, September 11–13, 2020.
Eichenfield LF, Flohr C, Sidbury R, et al. Efficacy and safety of abrocitinib in combination with topical therapy in adolescents with moderate-to-severe atopic dermatitis: the JADE TEEN randomized clinical trial. JAMA Dermatol. 2021;157:1165–73.
FDA. DUPIXENT® (dupilumab) injection, for subcutaneous use. Full Prescribing Information. Regeneron Pharmaceuticals, Inc./Sanofi-Aventis LLC 2021. https://www.accessdata.fda.gov/drugsatfda_docs/label/2021/761055s016lbl.pdf. Accessed Mar 3, 2022.
EMA. Dupixent: EPAR - Product Information. Annex I - Summary of Product Characteristics. Sanofi-Aventis Groupe S.A. 2022. https://www.ema.europa.eu/en/documents/product-information/dupixent-epar-product-information_en.pdf. Accessed Feb 3, 2022.
Naegeli AN, Flood E, Tucker J, Devlen J, Edson-Heredia E. The Worst Itch Numeric Rating Scale for patients with moderate to severe plaque psoriasis or psoriatic arthritis. Int J Dermatol. 2015;54:715–22.
Kimel M, Zeidler C, Kwon P, Revicki D, Stander S. Validation of psychometric properties of the itch numeric rating scale for pruritus associated with prurigo nodularis: a secondary analysis of a randomized clinical trial. JAMA Dermatol. 2020;156:1354–8.
Puelles J, de la Cruz M, Murphy R, Devlen J, Gabriel S. Qualitative equivalence of the worst itch and peak pruritus numerical rating scales in prurigo nodularis [poster]. European Academy of Dermatology and Venereology, EADV 35th Congress, 29 Sept to 02 Oct 2021. 2021.
Polit DF. Getting serious about test-retest reliability: a critique of retest research and some recommendations. Qual Life Res. 2014;23:1713–20.
Acknowledgements
The authors thank the interview and clinical trial participants who contributed to this study.
Author Contributions
Conception of the study: Danielle Rodriguez, Carla Dias-Barbosa, Ismail Budhiarso, Fatoumata Fofana, Sylvie Gabriel, Christophe Piketty; Design of the study: Shawn G. Kwatra, Danielle Rodriguez, Carla Dias-Barbosa, Ismail Budhiarso, Fatoumata Fofana, Margaret Vernon, Jorge Puelles; Analysis of the data: Danielle Rodriguez, Carla Dias-Barbosa, Ismail Budhiarso, Fatoumata Fofana; Writing and review of this article: all authors.
Funding
This study was funded by Galderma, which is currently sponsoring the development of nemolizumab for the treatment of prurigo nodularis. Galderma also funded the journal’s Rapid Service Fees.
Medical Writing, Editorial, and Other Assistance
The authors thank Rebecca Chanis, Charlie Smith, Laura Sweat and Rozanne Wilson (all formerly of Evidera) for conducting the qualitative interviews. The authors also thank Stephen Gilliver, PhD and Phillip Leventhal, PhD of Evidera for providing medical writing support, which was funded by Galderma in accordance with Good Publication Practice guidelines (https://www.ismpp.org/gpp-2022).
Data Availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Ethical Approval
The human studies reported here were performed in accordance with the Helsinki Declaration of 1964 and its later amendments. Institutional review board/ethics committee approval was obtained (Advarra Inc., protocol #PRE00528446), and all participants provided written informed consent to participate.
Conflict of Interest
Shawn G. Kwatra has received grants from Galderma, Incyte, Pfizer, and Sanofi, and consulting fees from AbbVie, Arcutis Biotherapeutics, Castle Biosciences, Celldex Therapeutics, Galderma, Incyte, Johnson & Johnson, Leo Pharma, Novartis, Pfizer, Regeneron Pharmaceuticals, and Sanofi. He has served on the Board of Directors of the Skin of Color Society. He was paid by Galderma for work relating to this study. Danielle Rodriguez, Carla Dias-Barbosa, Ismail Budhiarso, Fatoumata Fofana, and Margaret Vernon are employees of Evidera, which was paid by Galderma for work relating to this study. Sylvie Gabriel, Christophe Piketty, and Jorge Puelles are employees of Galderma, which funded this study and is currently sponsoring the development of nemolizumab for the treatment of prurigo nodularis.
Author information
Authors and Affiliations
Corresponding author
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/.
About this article
Cite this article
Kwatra, S.G., Rodriguez, D., Dias-Barbosa, C. et al. Validation of the Peak Pruritus Numerical Rating Scale as a Patient-Reported Outcome Measure in Prurigo Nodularis. Dermatol Ther (Heidelb) 13, 2403–2416 (2023). https://doi.org/10.1007/s13555-023-00999-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13555-023-00999-9