Open-label placebos—a systematic review and meta-analysis of experimental studies with non-clinical samples

Spille, Lukas; Fendel, Johannes C.; Seuling, Patrik D.; Göritz, Anja S.; Schmidt, Stefan

doi:10.1038/s41598-023-30362-z

Open-label placebos—a systematic review and meta-analysis of experimental studies with non-clinical samples

Article
Open access
Published: 04 March 2023

Volume 13, article number 3640, (2023)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Open-label placebos—a systematic review and meta-analysis of experimental studies with non-clinical samples

Download PDF

Lukas Spille¹,
Johannes C. Fendel²,
Patrik D. Seuling¹,
Anja S. Göritz¹ &
…
Stefan Schmidt²

4377 Accesses
11 Citations
19 Altmetric
2 Mentions
Explore all metrics

Abstract

The use of open-label placebos (OLPs) has shown to be effective in clinical trials. We conducted a systematic review and meta-analysis to examine whether OLPs are effective in experimental studies with non-clinical populations. We searched five databases on April 15, 2021. We conducted separate analyses for self-reported and objective outcomes and examined whether the level of suggestiveness of the instructions influenced the efficacy of OLPs. Of the 3573 identified records, 20 studies comprising 1201 participants were included, of which 17 studies were eligible for meta-analysis. The studies investigated the effect of OLPs on well-being, pain, stress, arousal, wound healing, sadness, itchiness, test anxiety, and physiological recovery. We found a significant effect of OLPs for self-reported outcomes (k = 13; standardized mean difference (SMD) = 0.43; 95% CI = 0.28, 0.58; I² = 7.2%), but not for objective outcomes (k = 8; SMD = − 0.02; 95% CI = − 0.25, 0.21; I² = 43.6%). The level of suggestiveness of the instructions influenced the efficacy of OLPs for objective outcomes (p = 0.02), but not for self-reported outcomes. The risk of bias was moderate for most studies, and the overall quality of the evidence was rated low to very low. In conclusion, OLPs appear to be effective when examined in experimental studies. However, further research is needed to better understand the mechanisms underlying OLPs.

Effects of open-label placebos in clinical trials: a systematic review and meta-analysis

Article Open access 16 February 2021

The Nocebo Effect: A Review of Contemporary Experimental Research

Article 17 August 2021

Placebo: a brief updated review

Article 09 August 2022

Introduction

The term ‘placebo’ is commonly defined as an inert substance or procedure^1,2. Placebos come in many forms such as sugar pills, saline injections, sham surgery, or verbal interventions. The ‘placebo response’ is best described as a biopsychosocial phenomenon that occurs subsequent to the application of a placebo^1,3. Because changes in the placebo group may also be due to confounding factors such as regression to the mean or the natural tendency of the disease, comparison with a no treatment condition is necessary to detect and quantify the ‘placebo effect’^1,4. In research, placebos are often applied as a control condition to determine a true intervention effect that is separate from influences that may be attributed to the psychosocial context^5,6,7. The finding that people’s conditions often improve in placebo groups has given rise to a new research domain, focusing entirely on the placebo¹. Aside from their scientific use, placebos are often administered in practical medicine to treat and/or appease patients⁸.

Placebos are mostly administered by physicians in a hidden way^1,9. Concealment was long assumed to be an elementary component for the occurrence of the placebo effect⁵. However, this approach has been criticized for withholding the truth from participants; thus violating both the concept of informed consent and autonomy^10,11. A potential solution to this problem is the use of open-label placebos (OLPs). OLPs are placebos that are administered openly without deception (i.e., subjects are told that they will receive a placebo). Research on OLPs dates back as early as 1965, when Park and Covi conducted a groundbreaking study on psychiatric outpatients¹², who received placebos and were clearly informed that they were receiving only sugar pills with no active ingredient. The patients in this uncontrolled study improved significantly in both physician ratings and patient ratings. However, the topic had not been taken up for 45 years when in 2010, Kaptchuk et al. published a study on the use of OLPs in irritable bowel syndrome that attracted attention¹³. This sparked interest in OLPs and led to an increase in research activities. A recent meta-analysis by our research group has shown that OLPs are effective when used in clinical trials¹⁴. The authors included studies examining OLPs for a range of conditions including ADHD, depression, chronic lower back pain, irritable bowel syndrome, allergic rhinitis, cancer-related fatigue, and menopausal hot flushes, and found an overall medium-sized effect (SMD = 0.72) of OLPs compared to no treatment. All outcomes used for analysis were based on self-report.

In addition to clinical trials, it is also important to investigate the effects of OLPs in experimental studies in non-clinical, healthy populations. The efficacy of OLPs in non-clinical populations may differ substantially from that found in clinical trials, as there are specific factors that may contribute to the placebo response only in clinical trials. For example, patients suffering from chronic or refractory diseases may be more likely than healthy peers to place hope in novel interventions such as OLPs¹⁵. Investigating the extent to which OLPs work in healthy participants will help to understand the conditions and mechanisms underlying the effects of OLPs without the confounding influence of patient hope. Finally, this review may contribute to the consideration of OLPs as an alternative to the continued use of deceptive placebos in clinical as well as experimental research and to overcome the ethical concerns associated with them. However, a systematic summary of experimental studies of OLPs with healthy samples is yet to be published.

The aim of this study is to conduct a systematic review and meta-analysis to determine the efficacy of OLPs in experimental studies with non-clinical, healthy populations. Therein, OLPs are compared either to no treatment (NT) or to covert placebo (CP). Covert placebo refers here to a control condition in which the same physical treatment is given as in the OLP condition. However, the covert placebo is administered with an explanation that distracts participants’ attention from the dependent variable, for example, by giving a technical reason for the placebo’s use. This explanation thus acts as a ‘cover story’ for the intended effect on the dependent variable. We hypothesize that (1) OLPs are more effective as compared to NT or CP. Moreover, research has shown that OLPs are more effective the more suggestive the instructions that accompany the placebo’s administration^13,16,17. Therefore, we hypothesize that (2) higher levels of suggestiveness are associated with greater OLP efficacy.

Results

Study selection

Our search of five electronic databases on April 15, 2021 (see “Methods” section below) yielded 3573 records (Fig. 1). After removing duplicates, the remaining 2352 titles and abstracts were screened, and 53 full texts were assessed for eligibility. We included 18 eligible articles comprising 20 studies (1201 participants) in the systematic review. Two articles reported on two independent experiments, each experiment involving a different sample^18,19. We included these experiments as individual studies in the analyses. An overview of the studies that we excluded after the full-text screening is in Table S1 (Supplementary Material). Three studies were excluded from the meta-analyses on account of their within-subject design^20,21,22, leaving a total of 17 studies.

Characteristics of studies, participants, and interventions

All of included studies were Randomized Controlled Trials (RCTs), of which three were crossover studies^20,21,22 and 17 were parallel-group studies^{16,18,19,23,24,25,26,27,28,29,30,31,32,33,34}. In two parallel-group studies, a balanced placebo design was used^33,34. Three studies used a CP control condition^18,28, and all other studies used a NT control condition. Five studies included more than one OLP condition^16,19,23,25. All studies were reported in English and published between 2001 and 2021. Nine studies were conducted in Germany^{19,24,25,29,30,31,32,34}, four in the United States^18,20,28, two in Switzerland^16,22, two in Australia^23,33, and one each in New Zealand²⁶, the Netherlands²⁷, and Brazil²¹.

Overall, 585 participants received an OLP intervention while 535 participants served as controls. In addition, 81 participants took part in crossover studies and therefore underwent both conditions. The sample sizes of individual studies ranged from 21 to 199.

Five studies investigated the effect of an OLP on pain^{16,22,25,28,29}. Three studies examined the influence of OLPs on well-being^19,23, three on stress^18,31, and two on arousal^32,34. OLPs have also been studied in relation to wound healing²⁶, sadness²⁴, itchiness²⁷, test anxiety³⁰, cycling performance²¹, physiological recovery³³, and muscle strength and fatigue²⁰. A wide range of placebos was used in the studies, including a nasal spray^18,24,29, pills^23,26,30,31, creams^16,25,28, capsules^20,21, bottled water¹⁹, decaffeinated coffee^32,34, verbal suggestions²⁷, acupuncture³³, and intravenous injection²². The degree of instructional suggestiveness was similarly distributed across studies, with five studies receiving a rating of 0 regarding the degree of instructional suggestiveness^{24,29,32,33,34}, six studies receiving a rating of 1^18,19,20,27, four studies receiving a rating of 2^21,25,26,28, and five studies receiving a rating of 3^{16,22,23,30,31}. Study characteristics are in Table 1. Detailed instructions accompanying the administration of the OLPs are in Table S2.

Table 1 Characteristics of included studies.

Full size table

Risk of bias within studies

There was little variation among studies in risk of bias (see Figure S1). Seventeen studies (85%) were rated as having “some concerns”^{16,18,19,20,21,22,23,24,25,27,28,29,30,31,32,33}, two studies (10%) as “low risk of bias”,^26,34 and one study (5%) as “high risk of bias”¹⁸. The randomization process was not adequately described in some studies^{18,19,20,21,22,29}. Others presented biases arising from the selection of the reported results^{16,18,19,20,21,22,24,25,27,28,29,30,31,32,33} or unblinded outcome assessors^{18,19,22,23,24,25,28,30,31}.

Data synthesis and analyses

Thirteen studies (772 participants, 401 in the experimental group and 371 controls) were included in the meta-analysis on self-reported outcomes: Analysis revealed a significant positive effect of OLPs compared to NT or CP (SMD = 0.43; 95% Cl = 0.28, 0.58; p < 0.01; Fig. 2). Heterogeneity was low and non-significant: χ² (12) = 12.94, p = 0.37, I² = 7.2%.

Eight studies (583 participants, 302 in the experimental group and 281 controls) were included in the meta-analysis on objective outcomes: We did not find a significant effect of OLP compared to NT or CP (SMD = − 0.02; 95% Cl = − 0.25, 0.21; p = 0.87; Fig. 3). Heterogeneity was low and non-significant: χ² (7) = 12.40, p = 0.09, I² = 43.6%. However, the results of the heterogeneity tests should be considered with caution due to low power³⁵.

There were five studies with more than one OLP group^16,19,23,25, necessitating the selection of data for the analyses. With one study²³ we chose to combine the data from both OLP groups, as they differed only in the number of placebos given to the participants. With the remaining four studies that had more than one OLP group^16,19,25, we followed our approach of selecting whichever group had the most suggestive instructions (i.e., those that included the most statements from Kaptchuk et al.¹³). Specifically, with one study that had more than one OLP group¹⁶, we used the data from the group receiving an open-label placebo with rationale (OPR+) instead of the group receiving open-label placebo without rationale (OPR −). With another study that had more than one OLP group²⁵, we used the data from the group in which expectancies were evoked (OLP-E) rather than the group in which hope was induced (OLP-H) as instructions in the OLP-E group were more closely aligned with Kaptchuk et al.¹³. Two further studies had three OLP conditions¹⁹. From these studies we used the data from the OPR + group, as the other groups either did not receive a placebo rationale (OPR group) or received an additional treatment in the form of a relaxation and imagination exercise (OPR++ group), which would have limited comparability with the placebo groups in the other included studies.

We obtained change scores for all studies except for one¹⁸, for which only post-intervention scores were reported due to lack of baseline measurements. Another study reported post-intervention values¹⁶, but we considered these as change scores, as they were adjusted for the corresponding baseline scores.

Subgroup analyses

For self-reported outcomes, the subgroups with different levels of suggestiveness did not differ significantly (Q(3) = 1.02; p = 0.80; Table S3). For objective outcomes, the subgroups differed significantly (Q(3) = 9.49; p = 0.02). Specifically, the subgroup representing studies in which only one suggestive statement was communicated showed a significant OLP effect (p = 0.009), while the other subgroups did not (all ps > 0.05). However, only eight studies were included in this analysis, with two subgroups involving only one study each. Therefore, these results should be interpreted with caution.

Exploratory subgroup analyses examining the influence of the type of control condition (CP or NT) on OLP efficacy revealed that subgroups differed significantly for both self-reported (Q(1) = 5.26; p = 0.02; see Table S4) and objective outcomes (Q(1) = 8.14; p < 0.01). In both analyses, the subgroup representing studies with CP controls descriptively yielded a larger effect size than the subgroup of studies with NT controls (see Table S4). However, the number of studies within subgroups was imbalanced. In the analysis of self-reported outcomes, 11 studies were in the subgroup of NT controls and only two studies were in the subgroup of CP controls. As for the analysis of objective outcomes, seven studies were in the subgroup of NT controls and only one was in the subgroup of CP controls. Because of the paucity of studies with CP controls, these results can only provide preliminary evidence and should be interpreted with great caution. Another exploratory analysis examining whether the OLP effect differed between studies involving completely healthy individuals and studies involving individuals with subclinical complaints (e.g., test anxiety) revealed no significant difference (Q(1) = 0.68; p < 0.41; see Table S5). Again, due to the limited number of studies, these results should be interpreted with caution.

Reporting bias

The visual inspection of the funnel plot as well as Egger’s regression test indicated no evidence of publication bias either for self-reported outcomes (intercept = − 1.85; 95% Cl − 5.51, − 1.81; p = 0.34; Fig. 4), or for objective outcomes (intercept = − 2.38; 95% Cl − 4.97, − 0.22; p = 0.12; Fig. 5). For objective outcomes, however, results should be interpreted with caution as Egger's regression test had low power³⁶.

Certainty of evidence

The overall quality of the evidence was assessed with the Grading of Recommendations Assessment, Development and Evaluation (GRADE)³⁷ approach. Using this tool, the overall quality of evidence was rated as low to very low. Specifically, the overall quality of evidence was rated as low for objective pain, self-reported distress, and self-reported well-being. The overall quality of evidence was rated as very low for self-reported pain and the sub-clusters of physiological outcomes. Details on the GRADE ratings are in Table S6.

Discussion

Although OLPs were studied as early as 1965, broader interest in this topic has emerged only recently, after several clinical trials of this particular type of placebo application were published^12,13,14. This provided the impetus for more research on OLPs, both in clinical and in experimental studies. We conducted a systematic review and meta-analysis to investigate whether the OLP effect found in clinical trials holds true in experimental studies with non-clinical, healthy individuals. We included 20 studies, of which 17 were suitable for meta-analyses. Thirteen studies analyzed the OLP effect with self-reported outcomes and eight studies with objective outcomes. The results of the meta-analyses revealed a small to medium OLP effect for self-reported outcomes and no OLP effect for objective outcomes. Subgroup analyses revealed that the level of suggestiveness of the instructions influenced the efficacy of OLPs for objective outcomes, but not for self-reported outcomes. However, due to the small number of studies, these results regarding suggestiveness should be viewed with caution. An exploratory subgroup analysis suggested that the use of CP as a comparator resulted in greater OLP efficacy as compared to NT. However, because of the limited number of studies using CP controls, these results are tentative and should be interpreted with caution.

Our finding of a significant OLP effect for self-reported outcomes in healthy individuals is consistent with a meta-analysis on clinical trials¹⁴. The aggregated SMD of that meta-analysis on clinical trials was larger than what we found in experimental studies with non-clinical, healthy individuals (SMD 0.72 vs. 0.43). There is reason to believe that patients suffering from clinical symptoms or medical conditions show higher placebo susceptibility because of plausible differences in the therapeutic encounter and contextual aspects surrounding the placebo administration in clinical trials compared to experimental ones. In the clinical setting, for example, emphasis is placed on attentive and supportive patient care. This may create a therapeutic bias leading patients to expect improvement³. As mentioned earlier, people seeking treatment often hope for relief¹⁵. Thus, there may be fundamental differences between patients and healthy individuals in the mechanisms that contribute to the placebo effect such as motivation and expectation¹. Consequently, there might also be differences in OLP effect between experimental studies on completely healthy participants versus participants with subclinical complaints. Such subclinical samples are also present in our study, for example, Schaefer et al.³⁰ who investigated the influence of OLPs on test anxiety. However, in the explorative subgroup analysis comparing experimental studies involving completely healthy participants with studies involving individuals with subclinical complaints we could not find any difference in the OLP effect.

Regarding the differences between self-reported and objective outcomes, our finding of a null effect for objective outcomes raises the question of whether OLPs and deceptive placebos have the same pattern of effect, as changes in objective outcomes have been repeatedly demonstrated in studies using deceptive placebos³⁸. One might therefore hypothesize that OLPs, unlike deceptive placebos, do not entail biological changes. However, Kaptchuk & Miller³ emphasize that also deceptive placebos primarily affect self-reported and self-appraised symptoms. Further studies comparing the effects of OLPs with deceptive placebos on objective outcomes are needed to clarify this issue.

The level of suggestiveness of the instructions showed an association with the efficacy of OLPs only for objective outcomes. This association was contrary to our hypothesis. In this analysis the one subgroup that showed a significant OLP effect consisted of only one study (Table S3), which had a sample size three times as large as the average study assessing objective outcomes and therefore had a substantially higher weight in the analysis relative to the others. Thus, this finding should be interpreted very cautiously. Overall, the observed results do not confirm our hypothesis that the OLP effect increases with higher instructional suggestiveness. However, this may change with the future publication of larger OLP studies that would make a reassessment of this issue possible.

This review has several strengths. First, two researchers performed the screening processes and the application of the assessment tools, ensuring high quality and reliability. Second, we used an established search strategy¹⁴, increasing the comparability with other reviews on OLPs. Third, we did not use any date or language restrictions and searched as many as five electronic databases to be comprehensive. Moreover, in both meta-analyses, the heterogeneity was low, and the statistical examination of funnel plot asymmetry revealed no indication of publication bias.

This review also has a number of limitations. First, the number of studies is small, and therefore, the stability of the results might be low. Second, the variability of outcomes in the included studies was large and unbalanced. While most studies were conducted in the context of pain research or well-being, only a few others were conducted in other contexts. This limits the generalizability of the findings and impedes the derivation of specific practical implications for the application of OLPs. The fact that the overall quality of evidence, as assessed by the GRADE approach, was low to very low underscores this point. Third, the variability of investigators of OLPs in experimental studies is limited, and some authors have been involved in multiple publications, highlighting the need for further independent studies and replications. Fourth, to assess instructional suggestiveness we used an ad hoc developed tool based on the four suggestiveness statements by Kaptchuk et al.¹³. We followed this procedure because the instructions of most OLP studies are based on these four statements and because studies in which the instructions deviated slightly from Kaptchuk's framework were easily able to be subjected to this taxonomy. However, in isolated cases such as Meeuwis et al.²⁷, there were instructions that deviated to a greater extent from the statements by Kaptchuk et al.¹³, indicating that an assessment based on our approach might not have captured all aspects of suggestiveness. Fifth, we used the RoB 2 to assess the methodological quality of the included studies. The application of the RoB 2 in the context of OLPs required an adaptation of the tool. Owing to the specifics of non-deceptive placebos, knowledge of the intervention received cannot be separated from the open-label placebo effect¹⁴. Therefore, we chose not to rate the knowledge of the intervention received as risk of bias. Future research would benefit from revised instruments that are adapted to the character of OLP interventions. Finally, one study was rated as having a high risk of bias. This study showed the largest effect size of all studies included in the meta-analysis of self-reported outcomes. Although the chi-squared test for heterogeneity was not significant in the analysis of self-reported outcomes and the I² statistic indicated negligible heterogeneity, this high-risk study may have inflated the assessment of the overall effect due to methodological flaws.

This review adds to the evidence that OLPs offer the possibility of improving subjective symptoms without the need to lie about the placebo or to take active agents. Having said this, the opinions of many physicians towards OLPs differ greatly³⁹. Patients, on the other hand, seem more open to this novel use of placebos. For example, in a study of placebo acceptability in patients with chronic pain, respondents indicated that they preferred open-label placebos to deceptive placebos⁴⁰. However, there are also studies suggesting the opposite, that is, patients preferring deceptive over open-label placebos⁴¹. The patients' desire for transparency and right to informed consent aligns with the calls of leading placebo researchers who oppose the use of deceptive placebos in clinical practice⁴². Based on their confirmed efficacy in initial studies and low side effects, we suggest identifying circumstances in which OLPs might be preferable to deceptive placebos even in clinical settings, given that deceptive placebos might be ethically questionable in some circumstances.

The results of this systematic review and meta-analysis to examine the effect of OLPs in experimental studies with non-clinical, healthy individuals suggest that OLPs are effective for self-reported outcomes, but not for objective outcomes. The degree of instructional suggestiveness seems to influence the efficacy of OLPs only for objective outcomes, but not in the direction it was expected. These findings need to be confirmed in future research based on a larger number of primary studies. This would also allow for an adequate statistical investigation of the influence of different control conditions on the efficacy of OLPs.

Methods

We preregistered this review at the Open Science Framework (OSF) on April 12, 2021 (https://doi.org/10.17605/OSF.IO/4CAFQ). The review adhered to the checklist of the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA)⁴³.

Eligibility criteria

Eligible studies had to meet the following criteria: (1) Population: We included studies with non-clinical populations. Studies with clinical populations were excluded. (2) Intervention: We included OLP interventions regardless of their specific application. (3) Comparison: We considered either a no-treatment control condition (NT) or covert placebo (CP) control condition. (4) Outcome: We included studies that measured the efficacy of OLPs on any given scale. Since the aim of this study was to investigate the effect of OLPs on a meta-level (i.e., across various outcomes), we did not apply restrictions to the types of outcomes. (5) Design: We included RCTs and excluded all other study designs.

Information sources and search strategy

We screened five electronic bibliographic databases comprising all entries from database inception to April 15, 2021. We did not apply any language restrictions. We searched for studies using Medline via PubMed (1965 to April 15, 2021), PsycINFO via EBSCO (1967 to April 15, 2021), PSYNDEX via EBSCO (1977 to April 15, 2021), Web of Science Core Collection (1945 to April 15, 2021), and The Cochrane Central Register of Controlled Trials (CENTRAL, The Cochrane Library, Wiley), Issue 4 of April 12, 2021. Due to its composite nature, CENTRAL does not have an inception date. However, we did not apply any date restrictions and used the latest issue available. In addition, we screened the Journal of interdisciplinary placebo studies DATABASE (JIPS, https://jips.online/).

We used a search strategy similar to von Wernsdorff et al.¹⁴. The search terms served the purpose of describing the OLP intervention in more detail. Therefore, in addition to terms such as "placebo", we used synonyms for "open-label", such as “non blind” or “without deception”. Since the aim of this study was to investigate the effect of OLPs on a general level, we did not specify outcomes and control conditions in the search strings. In addition, we used wildcards and variant forms of spelling to find as many studies as possible. The search strings are in Tables S7, S8, S9, S10, S11. Slight variations between the search strings are due to different proximity operators across the databases. We compiled all records identified in the databases in the reference management software Zotero 5.0.96.2 (Corporation for Digital Scholarship, Vienna, Virginia) and removed duplicates. We conducted both backward and forward citation searches of all included studies and important reviews on OLPs^14,44 using Web of Science and PsychINFO.

Study selection and data extraction

Two researchers (LS and PDS) independently screened titles, abstracts, and full texts for inclusion. Title and abstract screening were carried out using the systematic review software Rayyan (Rayyan QCRI, Doha, Qatar). Disagreements were resolved through discussion. If no consensus could be reached, JCF and SS were consulted. The chance-corrected agreement between raters after the full text screening was substantial (κ = 0.62). In cases where eligible studies did not report the necessary information to compute effect sizes, we contacted the authors of the studies. If the authors did not respond or were unable to provide the data, these studies were excluded.

The same two researchers, who selected the studies, independently extracted the data. Again, disagreements were resolved through discussion. We extracted data on: author, year, country of trial, study design, sample size, control condition, intervention characteristics, the exact wording of instructions given to the participants, as well as the type and number of outcomes into a spreadsheet. For outcomes, we extracted the means, sample sizes, and standard deviations. If reported, this was done for change scores, otherwise for both pre- and post-intervention scores. For studies where only the standard error was reported, we transformed the standard error into the standard deviation according to the procedure outlined in the Cochrane Handbook⁴⁵.

As stated in the preregistration, we extracted the primary outcome as specified in the individual studies. If multiple primary outcomes were specified in the individual studies, we extracted all primary outcomes. If no outcome was designated as primary, we extracted all outcomes. For studies that included multiple control conditions we only extracted data on the OLP condition and the corresponding comparator (i.e., NT or CP).

Study risk of bias assessment

We used the revised Cochrane risk of bias tool for randomized trials (RoB 2) to assess the risk of bias in primary studies. Five domains of bias are assessed using the RoB 2, namely biases arising from (1) the randomization process, (2) deviations from intended interventions, (3) missing outcome data, (4) measurement of the outcome, and (5) selection of the reported result⁴⁶. Ratings for each domain range from “low risk of bias”, to “some concerns”, to “high risk of bias”. Finally, the ratings of the individual domains are aggregated into an overall rating, which in most cases is equivalent to the worst rating in any of the domains⁴⁶.

Given the specific context of OLP, we agree with von Wernsdorff et al.¹⁴ that a lack of blinding of participants should not result in an increased risk of bias rating. They argue that knowledge of one's group assignment is imperative and cannot be separated from the placebo effect in this particular intervention. Thus, we decided to rate the risk of bias in the domains (2) and (4) (i.e., the risk of bias due to unblinding) as not worse than “some concerns”. Risk of bias assessments were carried out independently by LS and PDS, with discrepancies resolved through discussion with JCF and SS.

Data synthesis and analyses

Since knowledge of the received intervention might influence self-reported outcomes, we conducted two separate meta-analyses, one for self-reported outcomes and one for objectively recorded outcomes (i.e., physiological or behavioral variables). The meta-analyses were conducted using the meta package of R, version R 4.0.3. Since all studies reported continuous data, we chose the standardized mean difference (SMD) as the summary outcome. We used Hedges’ g, which corrects for small sample bias⁴⁷. When both pre- and post-intervention values were reported, we first calculated change scores by subtracting pre-intervention from post-intervention scores. We then standardized the difference in change scores between groups using the pooled pre-intervention SD to calculate the corresponding SMDs.

If there were multiple outcomes within one study, we calculated SMDs for all of these outcomes and averaged them^48,49. This approach ensured that there was no bias due to selective choice of outcome depending on effect size and conformity to the hypothesis.

When there were multiple OLP conditions within a study, we proceeded as follows: Our primary goal was to obtain the maximum OLP effect that could be realized experimentally. Since we assumed that suggestive instructions would amplify the placebo effect, we always chose whichever condition was most suggestive. This was operationalized by selecting the condition where most of the instructional statements from Kaptchuk et al.¹³ were utilized. Kaptchuk et al.¹³ were among the first to conduct a clinical trial of OLP and used a rationale (i.e., statements explaining the placebo effect) of four statements with positive framing to optimize placebo response. These statements imply 1) that the placebo effect is powerful, 2) that the body is automatically responding to placebos, 3) that it does not require a positive attitude, and 4) that taking the placebo faithfully is crucial. This or similar rationales were applied by many other researchers¹⁴.

Studies with crossover designs were not included in the meta-analyses as the coefficients required for the computation of the effect sizes were not reported. An alternative approach of analyzing crossover studies is to handle study groups as if they were parallel groups. However, this approach is not recommended by Cochrane as this may lead to a unit-of-analysis error⁵⁰.

Once the effects of the individual studies were calculated, they were aggregated into an overall SMD. We employed a random effects model by applying the inverse-variance weighting method⁴⁷. To correct for differences in the direction of the scale, the means of some studies were multiplied by − 1⁵⁰. Heterogeneity between studies was assessed using the chi-square test and the I² statistic. I² values above 25% are interpreted as low, above 50% as moderate, and above 75% as high heterogeneity⁵¹.

We conducted subgroup analyses to examine the influence of the number of instructions given alongside the OLPs on the efficacy of OLPs. We hypothesize that the statements in the instruction create expectations that, in turn, may elicit placebo effects. We refer to this process as suggestiveness. To assess the degree of the suggestiveness of the instructions in OLPs, we developed a tool based on the four statements applied by Kaptchuk et al.¹³. These statements are given along with the administration of the open-label placebos. However, the placebos in most experimental studies included in our review were administered only once and under the supervision of an experimenter. Therefore, we omitted the fourth statement and formed four subgroups depending on the number of statements utilized in the instructions (ranging from 0 = “no statement utilized” to 3 = “all statements utilized”), with higher values indicating greater suggestiveness. We believe this approach to be reasonable, as many studies investigating OLPs have adopted the instructions from Kaptchuk et al.¹³ and varied the number of statements implemented in the instructions. For the subgroup analyses, we first calculated the pooled effect for each subgroup and then used a Q test to examine whether effect sizes differed between subgroups⁵².

We also conducted exploratory subgroup analyses to examine (1) whether the efficacy of OLPs differed depending on the control condition used (i.e., NT or CP) and (2) whether the efficacy of OLPs differed in lab studies with healthy participants compared to clinically oriented studies involving individuals with subclinical complaints (e.g., test anxiety). We used the same statistical procedures as before. However, these exploratory analyses were specified a posteriori and therefore not reported in the preregistration.

All tests were two-tailed.

Reporting bias assessment

We assessed publication bias by visually inspecting funnel plots for asymmetry. In funnel plots, the SMDs of the individual studies are plotted against their standard error. In addition, we carried out a statistical assessment of funnel plot asymmetry using Egger's regression test, which regresses the SMDs against their standard error⁵³. We did not assess the risk for time-lag bias, as research on OLPs is in its early stages and the interest in non-clinical, healthy populations has arisen only recently.

Certainty assessment

We used the Grading of Recommendations Assessment, Development and Evaluation (GRADE)³⁷ approach to assess the overall quality of the evidence. At the beginning of the assessment process, the overall quality of an RCT is rated as high and can subsequently be down- or upgraded based on eight dimensions: (1) risk of bias, (2) inconsistency, (3) imprecision, (4) indirectness, (5) publication bias, (6) dose response, (7) large effects, and (8) confounding. Based on the ratings of each dimension, the overall quality of evidence is rated as “high”, “moderate”, “low”, or “very low”. GRADE is performed for specific outcomes. However, due to the large number of different outcomes, we decided to form five clusters, in which similar outcomes were grouped together: self-reported pain, objective pain, self-reported positive well-being, self-reported distress, and physiological outcomes. For physiological outcomes, we formed three sub-clusters, each containing a single study, to account for the heterogeneity in physiological outcomes. In our approach, a study may be represented in several clusters due to different outcome variables, but in each cluster only once. Assessments were conducted by two independent raters (LS and PDS), with discrepancies resolved through discussion.

Data availability

Data extracted from the included studies are available in a standardized Excel file, which can be found in the Supplementary Material.

References

Finniss, D. G., Kaptchuk, T. J., Miller, F. & Benedetti, F. Biological, clinical, and ethical advances of placebo effects. Lancet 375, 686–695 (2010).
Article PubMed PubMed Central Google Scholar
Louhiala, P. What do we really know about the deliberate use of placebos in clinical practice?. J. Med. Ethics 38, 403–405 (2012).
Article PubMed Google Scholar
Kaptchuk, T. J. & Miller, F. G. Placebo effects in medicine. N. Engl. J Med 373, 8–9 (2015).
Article CAS PubMed Google Scholar
Chaplin, S. The placebo response: An important part of treatment. Prescriber 17, 16–22 (2006).
Article Google Scholar
Beecher, H. K. The powerful placebo. JAMA 159, 1602 (1955).
Article CAS Google Scholar
Kaptchuk, T. J. Powerful placebo: The dark side of the randomised controlled trial. Lancet 351, 1722–1725 (1998).
Article CAS PubMed Google Scholar
Schmidt, S. Context matters! what is really tested in an RCT? BMJ EBM bmjebm-2022–111966 (2022) https://doi.org/10.1136/bmjebm-2022-111966.
Specker Sullivan, L. More than consent for ethical open-label placebo research. J. Med. Ethics 47, e7–e7 (2021).
Article Google Scholar
Stafford, N. German doctors are told to have an open attitude to placebos. BMJ 342, d1535–d1535 (2011).
Article PubMed Google Scholar
Annoni, M. The ethics of placebo effects in clinical practice and research. Int. Rev. Neurobiol. 139, 463–484 (2018).
Article PubMed Google Scholar
Miller, F. G. & Colloca, L. The legitimacy of placebo treatments in clinical practice: Evidence and ethics. Am. J. Bioethics 9, 39–47 (2009).
Article Google Scholar
Park, L. C. & Covi, L. Nonblind placebo trial: An exploration of neurotic patients’ responses to placebo when its inert content is disclosed. Arch. Gen. Psychiatry 12, 336 (1965).
Article Google Scholar
Kaptchuk, T. J. et al. Placebos without deception: A randomized controlled trial in irritable bowel syndrome. PLoS ONE 5, e15591 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
von Wernsdorff, M., Loef, M., Tuschen-Caffier, B. & Schmidt, S. Effects of open-label placebos in clinical trials: A systematic review and meta-analysis. Sci. Rep. 11, 3855 (2021).
Article ADS Google Scholar
Kaptchuk, T. J. & Miller, F. G. (2018) Open label placebo: can honestly prescribed placebos evoke meaningful therapeutic benefits? BMJ k3889 https://doi.org/10.1136/bmj.k3889.
Locher, C. et al. Is the rationale more important than deception? A randomized controlled trial of open-label placebo analgesia. Pain 158, 2320–2328 (2017).
Article PubMed Google Scholar
Schaefer, M., Sahin, T. & Berstecher, B. Why do open-label placebos work? A randomized controlled trial of an open-label placebo induction with and without extended information about the placebo effect in allergic rhinitis. PLoS ONE 13, e0192758 (2018).
Article PubMed PubMed Central Google Scholar
Guevarra, D. A., Moser, J. S., Wager, T. D. & Kross, E. Placebos without deception reduce self-report and neural measures of emotional distress. Nat. Commun. 11, 3785 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Rathschlag, M. & Klatt, S. Open-label placebo interventions with drinking water and their influence on perceived physical and mental well-being. Front. Psychol. 12, 658275 (2021).
Article PubMed PubMed Central Google Scholar
Swafford, A. P. et al. No acute effects of placebo or open-label placebo treatments on strength, voluntary activation, and neuromuscular fatigue. Eur. J. Appl. Physiol. 119, 2327–2338 (2019).
Article CAS PubMed Google Scholar
Saunders, B. et al. “I put it in my head that the supplement would help me”: Open-placebo improves exercise performance in female cyclists. PLoS ONE 14, e0222982 (2019).
Article CAS PubMed PubMed Central Google Scholar
Schneider, T., Luethi, J., Mauermann, E., Bandschapp, O. & Ruppen, W. Pain response to open label placebo in induced acute pain in healthy adult males. Anesthesiology 132, 571–580 (2020).
Article PubMed Google Scholar
El Brihi, J., Horne, R. & Faasse, K. Prescribing placebos: An experimental examination of the role of dose, expectancies, and adherence in open-label placebo effects. Ann. Behav. Med. 53, 16–28 (2019).
Article PubMed Google Scholar
Glombiewski, A. J., Julia, R., Julia, W., Lea, R. & Winfried, R. Placebo mechanisms in depression: An experimental investigation of the impact of expectations on sadness in female participants. J. Affect. Disord. 256, 658–667 (2019).
Article Google Scholar
Kube, T. et al. Deceptive and nondeceptive placebos to reduce pain: an experimental study in healthy individuals. Clin. J. Pain 36, 68–79 (2020).
Article PubMed Google Scholar
Mathur, A., Jarrett, P., Broadbent, E. & Petrie, K. J. Open-label placebos for wound healing: A randomized controlled trial. Ann. Behav. Med. 52, 902–908 (2018).
Article PubMed Google Scholar
Meeuwis, S. et al. Placebo effects of open-label verbal suggestions on itch. Acta Derm. Venerol. 98, 268–274 (2018).
Article CAS PubMed Google Scholar
Mundt, J. M., Roditi, D. & Robinson, M. E. A comparison of deceptive and non-deceptive placebo analgesia: efficacy and ethical consequences. Ann. Behav. Med. 51, 307–315 (2017).
Article PubMed Google Scholar
Rief, W. & Glombiewski, J. A. The hidden effects of blinded, placebo-controlled randomized trials: An experimental investigation. Pain 153, 2473–2477 (2012).
Article PubMed Google Scholar
Schaefer, M. et al. Open-label placebos reduce test anxiety and improve self-management skills: A randomized-controlled trial. Sci. Rep. 9, 13317 (2019).
Article ADS PubMed PubMed Central Google Scholar
Schaefer, M., Hellmann-Regen, J. & Enge, S. Effects of open-label placebos on state anxiety and glucocorticoid stress responses. Brain Sci. 11, 508 (2021).
Article CAS PubMed PubMed Central Google Scholar
Schneider, R. et al. Effects of expectation and caffeine on arousal, well-being, and reaction time. Int. J. Behav. Med. 13, 330–339 (2006).
Article PubMed Google Scholar
Urroz, P., Colagiuri, B., Smith, C. A., Yeung, A. & Cheema, B. S. Effect of acupuncture and instruction on physiological recovery from maximal exercise: A balanced-placebo controlled trial. BMC Complement Altern. Med. 16, 227 (2016).
Article PubMed PubMed Central Google Scholar
Walach, H., Schmidt, S., Bihr, Y.-M. & Wiesch, S. The effects of a caffeine placebo and experimenter expectation on blood pressure, heart rate, well-being, and cognitive performance. Eur. Psychol. 6, 15–25 (2001).
Article Google Scholar
Deeks, J. J., Higgins, J. P., Altman, D. G. & Group, C. S. M. Analysing data and undertaking meta-analyses. In Cochrane Handbook for Systematic Reviews of Interventions. On behalf of the cochrane statistical methods group. pp. 241–284 (2019).
Sterne, J. A. C., Egger, M. & Moher, D. Addressing reporting biases. In: Higgins JPT, Green S (editors). Cochrane Handbook for Systematic Reviews of Interventions. Version 5.1. 0 (updated March 2011). The Cochrane Collaboration, 2011. Available from handbook. cochrane.org (2011).
Guyatt, G. H. et al. GRADE: An emerging consensus on rating quality of evidence and strength of recommendations. BMJ 336, 924–926 (2008).
Article PubMed PubMed Central Google Scholar
Benedetti, F. Placebo-induced improvements: How therapeutic rituals affect the patient’s brain. J. Acupunct. Meridian Stud. 5, 97–103 (2012).
Article PubMed Google Scholar
Bernstein, M. H. et al. Primary care providers’ use of and attitudes towards placebos: An exploratory focus group study with US physicians. Br. J. Health Psychol. 25, 596–614 (2020).
Article PubMed PubMed Central Google Scholar
Wolter, T. & Kleinmann, B. Placebo acceptability in chronic pain patients: More dependent on application mode and resulting condition than on individual factors. PLoS ONE 13, e0206968 (2018).
Article PubMed PubMed Central Google Scholar
Haas, J. W., Rief, W. & Doering, B. K. Open-label placebo treatment: outcome expectations and general acceptance in the lay population. Int. J. Behav. Med. 28, 444–454 (2021).
Article PubMed Google Scholar
Evers, A. W. M. et al. What should clinicians tell patients about placebo and nocebo effects? Practical considerations based on expert consensus. Psychother. Psychosom. 90, 49–56 (2021).
Article PubMed Google Scholar
Page, M. J. et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. Syst. Rev. 10, 89 (2021).
Article PubMed PubMed Central Google Scholar
Charlesworth, J. E. G. et al. Effects of placebos without deception compared with no treatment: A systematic review and meta-analysis. J. Evid. Based Med. 10, 97–107 (2017).
Article PubMed Google Scholar
Higgins, J. P. T. & Deeks, J. J. Selecting studies and collecting data. In: Higgins JPT, Green S (editors), Cochrane Handbook for Systematic Reviews of Interventions Version 5.1. 0 (updated March 2011). The Cochrane Collaboration, 2011. Available from handbook. cochrane.org (2011).
Sterne, J. A. C. et al. RoB 2: A revised tool for assessing risk of bias in randomised trials. BMJ 366, l4898 (2019).
Article PubMed Google Scholar
Döring, N. & Bortz, J. Forschungsmethoden und Evaluation in den Sozial- und Humanwissenschaften (Springer, Berlin, 2016). https://doi.org/10.1007/978-3-642-41089-5.
Book Google Scholar
López-López, J. A., Page, M. J., Lipsey, M. W. & Higgins, J. P. T. Dealing with effect size multiplicity in systematic reviews and meta-analyses. Res. Syn. Meth. 9, 336–351 (2018).
Article Google Scholar
Marín-Martínez, F. & Sánchez-Meca, J. Averaging dependent effect sizes in meta-analysis: A cautionary note about procedures. Span. J. Psychol. 2, 32–38 (1999).
Article PubMed Google Scholar
Higgins, J. P. et al. Cochrane Handbook for Systematic Reviews of Interventions (John Wiley & Sons, 2019).
Book Google Scholar
Higgins, J. P. T. Measuring inconsistency in meta-analyses. BMJ 327, 557–560 (2003).
Article PubMed PubMed Central Google Scholar
Harrer, M., Cuijpers, P., Furukawa, T. A. & Ebert, D. D. Doing Meta-analysis with R: A Hands-on Guide (Chapman and Hall/CRC, Boca Raton, 2021). https://doi.org/10.1201/9781003107347.
Book Google Scholar
Egger, M., Smith, G. D., Schneider, M. & Minder, C. Bias in meta-analysis detected by a simple, graphical test. BMJ 315, 629–634 (1997).
Article CAS PubMed PubMed Central Google Scholar

Download references

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Department of Psychology, Occupational Psychology, University of Freiburg, Freiburg, Germany
Lukas Spille, Patrik D. Seuling & Anja S. Göritz
Department of Psychosomatic Medicine and Psychotherapy, Medical Center - University of Freiburg, Faculty of Medicine, University of Freiburg, Freiburg, Germany
Johannes C. Fendel & Stefan Schmidt

Authors

Lukas Spille
View author publications
You can also search for this author in PubMed Google Scholar
Johannes C. Fendel
View author publications
You can also search for this author in PubMed Google Scholar
Patrik D. Seuling
View author publications
You can also search for this author in PubMed Google Scholar
Anja S. Göritz
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Schmidt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

S.S. developed the study concept. L.S. was responsible for study execution and writing of the first draft of the manuscript, with assistance by S.S. and J.C.F. L.S. and P.D.S. conducted the title, abstract, and full-text screening, extracted data, and rated the risk of bias as well as the overall quality of evidence (GRADE). J.C.F. performed the statistical analysis. A.S.G. assisted in the study execution. All authors critically revised the manuscript.

Corresponding author

Correspondence to Stefan Schmidt.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Spille, L., Fendel, J.C., Seuling, P.D. et al. Open-label placebos—a systematic review and meta-analysis of experimental studies with non-clinical samples. Sci Rep 13, 3640 (2023). https://doi.org/10.1038/s41598-023-30362-z

Download citation

Received: 22 September 2022
Accepted: 21 February 2023
Published: 04 March 2023
DOI: https://doi.org/10.1038/s41598-023-30362-z
Springer Nature Limited

This article is cited by

Open-label placebos enhance test performance and reduce anxiety in learner drivers: a randomized controlled trial
- Michael Schaefer
- Sören Enge
Scientific Reports (2024)
A randomized controlled trial of effects of open-label placebo compared to double-blind placebo and treatment-as-usual on symptoms of allergic rhinitis
- Michael Schaefer
- Kurt Zimmermann
- Paul Enck
Scientific Reports (2023)
Open-label placebo treatment does not enhance cognitive abilities in healthy volunteers
- Helena Hartmann
- Katarina Forkmann
- Ulrike Bingel
Scientific Reports (2023)

Open-label placebos—a systematic review and meta-analysis of experimental studies with non-clinical samples

Abstract

Similar content being viewed by others

Introduction

Results

Study selection

Characteristics of studies, participants, and interventions

Risk of bias within studies

Data synthesis and analyses

Subgroup analyses

Reporting bias

Certainty of evidence

Discussion

Methods

Eligibility criteria

Information sources and search strategy

Study selection and data extraction

Study risk of bias assessment

Data synthesis and analyses

Reporting bias assessment

Certainty assessment

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation