FormalPara Take-home message

Coaching doctors regarding ethical decision making in teams may improve goal-oriented care operationalized via written do-not-intubate and do-not attempt cardiopulmonary resuscitation orders.

Introduction

Fast medical progress and technological innovation pose a significant challenge to doctors, who are asked to find the delicate balance between life-prolonging treatment and palliative care [1,2,3]. Although the medical community continues to put tremendous efforts in trying to enhance prognostication via objective and thus universal factors or scoring systems [4], literature highlights a large variability in written do-not-intubate and do-not attempt cardiopulmonary resuscitation (DNI-DNACPR) decisions, use of health care resources at end-of-life, referral to the intensive care unit (ICU), palliative care and place of death across continents, countries, hospitals, wards, doctors and patients [5,6,7,8,9,10,11,12,13,14]. This indicates that organizational [10, 15, 16] but also subjective factors at the continent and country (“culture”), team (“climate”) and personal (“style”) level exert a greater influence than objective factors in medical ethical decision-making [10, 15,16,17,18,19,20]. However, subjective factors are rarely acknowledged by clinicians, more specifically by doctors at the bedside [20,21,22,23,24].

Becoming aware of one’s personal underlying emotions (self-reflection) that are inherent to complex medical ethical decisions such as anxiety, powerlessness and guilt, and learning to better cope with these emotions (self-regulation) are thus together with motivation, empathy and social skills (emotional intelligence) [25, 26] essential to guarantee patient-centered care [20,21,22,23,24]. Furthermore, timely sharing of emotions, conflicting ideas, values, knowledge and experience between professionals with different backgrounds within a safe ethical climate may help in reducing prognostic uncertainty in doctors and in stimulating ethical awareness in the team [8, 20, 27,28,29,30,31,32]. For instance, whereas doctors focus more on the proportionality between treatment intensity and estimated prognosis than nurses, nurses acknowledge dignified care [32] and the need for open communication towards patients and relatives sooner than doctors during hospital stay [29]. This collective awareness based on complementary visions enriches the ethical decision-making process for the benefit of the patient and may increase doctors’ self-confidence in effectively communicating and making decisions with regard to immediate or future potentially excessive treatments together with the patient and his or her relatives [8, 20, 27]. Besides mitigating the risk of prolonged suffering and complicated grief among patients and relatives respectively [2, 3, 33, 34], timely and appropriate ethical decision-making within teams may also reduce burnout and intent to leave among clinicians [3, 20, 35,36,37] and the cost for society [3, 38, 39]. However, creating a safe ethical climate which enables nurses and other members of the team to speak up about patient safety concerns [40], more specifically with regard to potentially excessive treatments, requires specific leadership skills [20, 25,26,27, 41,42,43] that are not or insufficiently taught in the medical curriculum [43, 44].

The aim of this study was to assess whether coaching doctors during 4 months in self-reflective and empowering leadership, and in managing team dynamics with regard to ethical decision-making increases the incidence of goal-oriented care operationalized by written DNI-DNACPR decisions in adult patients potentially receiving excessive treatment (PET) during their first hospital stay and the perception of the quality of the ethical climate by clinicians.

Methods

We refer to a previous publication for the detailed study and coaching protocol [20].

Participants

All medical and surgical departments within Ghent University Hospital frequently referring adult (> 18 year old) patients to the ICU were invited to participate in the current study during meetings organized in 2018–2019. Surgical departments did not express the need to improve ethical decision-making in our hospital. All eight internal medicine departments (Cardiology, Gastro-enterology and Hepatology, General Internal Medicine, Geriatrics, Hematology, Medical Oncology, Nephrology, Pulmonology) frequently referring adult patients to the ICU and the Neurology department acknowledged room to improve their ethical decision-making [20] and were thus included in this study together with the medical ICU.

All junior and senior doctors taking care of hospitalized patients in the ten participating departments were eligible for the intervention. Junior doctors were defined as doctors in training. All nurses and doctors were invited to fill out the ethical decision-making climate questionnaire (EDMCQ) [27] before and after the 12 months study period (February 2022 until February 2023) and to identify PET during the study period. In line with a previous study performed in 68 ICUs, PET were defined as patients who were perceived by two or more doctors or nurses as receiving excessive treatment [8]. These patients had a 7% probability of surviving at home with a good quality of life at 1 year [8]. Excessive treatment was defined as treatment that is perceived to be no longer consistent with the expected survival or quality of life (“too much treatment”) or that is perceived as being provided against the patient’s or relatives’ wishes [3, 8]. This definition entails thus potentially futile, non-beneficial, inappropriate or even harmful treatments [3, 45], which has been estimated to be provided to 33–38% of patients near the end-of-life [9], as well as treatments that are potentially being provided without voluntary and informed consent [3]. Only adult patients hospitalized for the first time in the hospital were included in this study. Study data were collected and managed using REDCap electronic data capture tools hosted at the Ghent University Hospital [46].

We refer to Tables 1 and 2 for the baseline characteristics that were collected at the clinician and patient level, respectively.

Table 1 Clinician characteristics before and after the study period, overall and according to their role
Table 2 Characteristics of patients potentially receiving excessive treatment and overall study population

Study design and randomization

The study followed a stepped wedge cluster randomized trial design, run across the ten different participating departments. All ten departments were randomly assigned to start a 4-month coaching period based on a random number generator in the software R by the statistician. The study was not blinded, however clinicians were asked to not actively inform patients or relatives about the timing of the intervention.

Intervention

The intervention in junior and senior doctors consisted of four components: (1) a single interactive session lasting 2–3 h focusing on the concepts of medical ethical decision-making, the psychological challenge of dealing with ethically sensitive medical topics, and empowering leadership; (2) observation of interdisciplinary team meetings by a first coach who also gave feedback to the doctor in charge to enhance self-reflection on empowering leadership and managing group dynamics during the 4-months intervention period; (3) individual coaching by a second coach, centered on fostering self-reflection and empowering leadership, along with addressing group dynamics concerning ethical decision-making about PET during the four months intervention period. In absence of such cases, the focus shifted to encompass all critical aspects of ethical decision-making pertinent to doctors. Each doctor was invited to participate to at least eight coaching sessions of 1 h during the intervention period, to be extended upon request. Coaching was provided by senior clinical psychologists specialized in systemic psychodynamic coaching. Both coaches received supervision provided by an internationally recognized coaching expert and faculty within this coaching model; (4) throughout the intervention coaches and doctors in charge were informed of the presence of a PET within their ward by an electronic alert. This alert was accessible via the electronic patient data record. Nurses and doctors were instructed to use this CODE alert for each patient under their care in whom they deemed that treatment was excessive. The CODE alert and the study are briefly explained in the video available at https://youtu.be/68TDLdW1vFQ. This video was also used in the multifaceted communication campaign that was held throughout the entire study period to enhance participation and inclusion of PET (supplementary File 1). Progress in self-reflection was surveyed by the second coach.

Primary endpoints and hypotheses

The first primary endpoint in this study was written DNI-DNACPR decision. The second primary endpoint was the EDMCQ. The EDMCQ is a 32-item validated questionnaire [8, 20, 27] that consists of seven domains: factor F1 “self-reflective and empowering leadership of doctors”, F2 “open and interdisciplinary reflection”, F3 “not avoiding end-of-life decisions”, F4 “mutual respect within the interdisciplinary team”, F5 “active involvement of nurses in end-of-life care and decision-making”, F6 “active decision-making by doctors” and F7 “ethical awareness”. For this study, we focused on the 30 EDMCQ items that are also applicable outside the ICU setting. We decided to use two endpoints because ethical sensitive decisions and the climate in which these decisions are taken are intrinsically connected with each other. We hypothesized that the intervention would affect F1, which in turn would affect all other EDMCQ factors and via these, the incidence of written DNI-DNACPR decisions [20].

These primary objectives were formalized in the following hypotheses: (1) the intervention changes the incidence in written DNI-DNACPR decisions in PET from 35% to 50% over the 12-month study period; (2) the intervention increases the mean EDMCQ factor sum-score in clinicians (doctors and nurses) by at least 2.8 points (equals the sum of the differences in the 7 factors between units with “good” and with an “average ethical climate with involvement of nurses at end-of-life” in the DISPROPRICUS study) over the 12-month study period [8, 20].

Power analysis

A Monte Carlo simulation based on a pilot study conducted in 2019 evaluated that 605 PET would be required to detect an increase in written DNI-DNACPR decisions from 35% before to 50% post intervention with 86% power at the 5% significance level [20]. For the change in EDMCQ score after versus before the intervention, a Monte Carlo power evaluation showed that 5 clinicians per department were required to detect an increase with 2.8 points with 93% power at the 5% significance level.

Statistical analysis

The written DNI-DNACPR decision analysis was conducted by logistic mixed effect models with random intercept to account for between-department variability, assuming a constant risk before and after intervention, and a linearly changing risk (0 to 1) during the intervention [20], considering at least 1 month of coaching for an effect. This adjustment captures a potentially time-dependent treatment effect, not explicitly accounted for in the initial protocol, while maintaining balance between the intervention and the control arm. The analysis included patients without a written DNI-DNACPR decision at the time of the first registration, in order to investigate the coaching effect on written DNI-DNACPR incidence during the study period. The change in EDMCQ score was analyzed based on linear mixed effects models including an intervention indicator and a random intercept to account for between-department variability. Survival analysis at 1-year was conducted using stratified cox proportional hazards model by department, while allowing potential variations across departments. Naive models are department-specific adjusted models only including intervention effect as predictor. To evaluate the robustness of our findings, these naive models were additionally adjusted for differences in baseline patients characteristics. Two-sided P-values were calculated.

Because of the small number of PET and subsequent power issues, we decided to perform a post hoc analysis on the overall population of first admissions and to restrict this analysis to the primary endpoints and one of our predefined secondary safety endpoints (mortality up to 1 year) to reduce the risk of type I errors. After the study period, clinicians were surveyed about the reasons for the low identification of PET.

Ethics

This study was conducted according to the Declaration of Helsinki and has been reviewed and approved by the Ethics Committee of the Ghent University Hospital (BC-09828, date of approval May 27th 2021). This committee includes patient representatives.

An electronic and written informed consent was required for clinicians to participate to this study and to the individual coaching sessions, respectively. Informed consent was waived for post-hoc anonymous data on written DNI-DNACPR decisions and mortality provided by the independent Data Science Institute. We asked informed consent to PET to collect survey data on satisfaction and well-being, however, these data were not used because of underpowering.

In line with the models for shared decision making [1, 47], attending doctors in Belgium must obtain voluntary and informed consent for treatment limitation decisions such as DNI-DNACPR from the patient or one of his/her representatives in case of incapacity (“Patients Rights Act” 2002, Article 96).

Results

Of the 93 and 94 doctors, and 349 and 329 nurses working before February 2022 and after February 2023 in the ten departments, respectively 63 (67.7%), 63 (67%), 207 (59.3%) and 198 (60.1%) filled out the EDMCQ, resulting in an overall response rate of 61% and 61.7% (Fig. 1). Thirty-eight doctors and 127 nurses filled out the EDMCQ at both moments, before and after the study period. The characteristics of the departments and clinicians are listed in the supplementary File 2 and Table 1, respectively.

Fig. 1
figure 1

Flowchart. PET patients potentially receiving excessive treatment, DNI-DNACPR do-not-intubate and do-not-attempt-cardiopulmonary resuscitation

Of the 7254 patients hospitalized for the first time during the study period, 125 (1.7%) were identified as PET. In 16 PET outcome data were missing. Twenty-six of the 109 (23.9%) PET and 624 of the 7238 (8.6%) overall study population already had a written DNI-DNACPR decision at study entry, resulting in 83 and 6614 patients who were included in the main and post-hoc analysis, respectively. Randomization at the department level resulted in 37 vs. 46 patients respectively in the intervention vs. control arm for the main analysis and 4012 vs. 2602 for the post hoc analysis. The baseline characteristics of the PET and the overall study population in both arms are listed in Table 2.

The reasons according to the clinicians for having identified less PET than in our pilot study are listed in supplementary File 2. The top five reasons were: fading attention for the study over time (respectively, 53.7% in doctors and 74.7% in nurses), poor visibility of the CODE alert to identify patients in the electronic patient file (34.3% and 50.7%), fear of blaming doctors (28.4% and 44.2%), disbelief that the identification will change something at the patient level (14.9% and 42.9%), and because interdisciplinary meetings about end-of-life issues increased during the study (35.8% and 35%). Respectively, 71.6% of the doctors and 94.5% of the nurses wished to keep on using the CODE alert in the future, although 9% and 23% were concerned about their anonymity.

Reach and adherence to the coaching and observed progress by the coach

Overall, 78 observations (7–8 per department) of interdisciplinary team meetings occurred during the 4-month intervention period, where feedback was provided to the doctors about their empowering leadership style and the group dynamic. Of the 51 senior and 42 junior doctors, respectively 34 (66.7%) and 22 (52.3%) participated in the single interactive session of 2–3 h focusing on the concepts of medical ethical decision-making and 32 (62.7%) and 18 (42.8%) participated in the individual coaching for a mean (standard deviation, SD) of 4.36 (2.55) sessions. Of the 248 individual coaching sessions, 40 (16.1%) concerned PET who were identified by the team during the intervention period, 102 (41.1%) concerned hospitalized patients who were mentioned by the doctor as potentially receiving excessive treatment and who were not identified by the team during the intervention period, and 116 (46.7%) concerned patients who potentially received excessive treatment according to the doctor prior to the intervention or coaching how to cope with ethical dilemmas in general. The observed progress in the five self-reflection items regarding ethical decision-making in teams according to the coach before and after the intervention is depicted in the supplementary File 2.

Main and post hoc analysis of the first primary endpoint and safety endpoint

The estimated incidence of written DNI-DNACPR decisions in PET was 29.7% in the intervention vs. 19.6% in the control arm (Table 3). The odds ratios (95% confidence interval) of the intervention in the naive and adjusted analysis were, respectively, 4.24 (4.21–4.27) and 3.71 (0.54–25.5). The estimated one-year mortality was 85% and 83.7%, respectively. The hazard ratios of the intervention regarding 1-year mortality were, respectively, 2.76 (1.26–6.04) and 3.26 (1.35–7.84).

Table 3 Results of the primary endpoints and the second safety (1 year mortality) endpointa

The estimated incidence of written DNI-DNACPR decisions in the overall study population was 3.4% in the intervention vs. 1.9% in the control arm. There was a statistically significant increase in nearly all written treatment-limitation-decisions other than DNI-DNACPR after the intervention (supplementary File 2), including the decision of not referring the patient to the medical ICU (2.17% vs 1.19%, P = 0.004). The odds ratios (95% confidence interval) for written DNI-DNACPR orders in the naive and adjusted analysis were, respectively, 1.65 (1.12–2.43) and 2 (1.34–3). The estimated 1-year mortality was 14.5% and 15.1%, respectively. The hazard ratios of the intervention regarding 1-year mortality were, respectively, 0.89 (0.72–1.09) and 1.03 (0.84–1.27).

The results of the adjusted models, in which we observed a linear coaching exposure effect relationship of the intervention, and the Kaplan–Meier survival curves are depicted in detail in supplementary File 2.

Main analysis and sub-analysis of the second primary endpoint

The mean (standard deviation) EDMCQ before and after the study period was 0.45 (0.16) vs. 0.47 (0.16) points (Table 2). The absolute effect estimate of the intervention in the naive and adjusted analysis was 0.02 (0.11) and 0.02 points (0.11), respectively (P = 0.81 and P = 0.83).

The sub-analysis of the differences across the 7 EDMCQ factors before and after the study period are depicted in Fig. 2. We observed only a highly statistically significant increase in the factor “ethical awareness” (P < 0.001) and a statistically significant decrease in the factor “mutual respect within the interdisciplinary team” (P = 0.032).

Fig. 2
figure 2

EDMCQ pre-post for all clinicians. EDMCQ ethical-decision-making climate questionnaire. Two-sided t-tests on combined paired and unpaired data were performed. The stars are only intended to flag levels of significance: *P < 0.05; **P < 0.01; ***P < 0.001.

Safety and adverse events

We received no complaints from patients or clinicians via the service, heads or head nurses from the departments.

Discussion

According to four recent meta-analyses [48,49,50,51], this is the first large multi-department cluster randomized trial assessing the impact of coaching doctors in self-reflective and empowering leadership, as well as in managing groups dynamics on concrete patient and clinician outcomes, more specifically regarding ethical decision-making in teams. In this study, we observed that (1) although ethical decision-making is a burning issue in many hospitals, clinicians identified a much smaller number of PET during the current interventional study than during our observational pilot study in 2019, (2) the intervention highly likely increased the incidence of written DNI-DNACPR decisions in the overall study population without increasing the 1-year mortality and (3) the intervention was not associated with a significant improvement in the quality of the ethical climate as perceived by clinicians.

Despite advance care planning and goal-oriented care being officially considered as a strategic priority in our hospital, and an intensive communication campaign in the weeks preceding and during the study period, clinicians identified ten times less PET than observed in our pilot study [20], resulting in a fivefold lower inclusion rate required to detect an intervention effect according to our power analysis. In contrast to our pilot study, we focused only on patients admitted for the first time in the hospital during the current study, which represents 63.6% of the entire patient population that is in average admitted in the participating wards yearly (supplementary file 2). With a 10% estimated incidence of PET, we still expected to include enough PET to detect a difference with 86% power at the 5% significance level [20]. However, we also used the CODE alert to identify PET instead of a dedicated researcher who actively surveyed clinicians. Identifying PET in the current study had also consequences for doctors and potentially for patients. Although fading attention for the study over time and visibility of the electronic CODE alert to identify PET was claimed as the main reasons by 75% and 50.7% of the nurses, respectively, in comparison to 53.7% and 34.3% of the doctors, 95% and 71.6% expressed the desire to keep on using this alert in the future. This underscores a deeper concern, particularly in nurses. More than 40% expressed fear of blaming doctors or skepticism regarding the impact of identifying PET. Nonetheless, 35% acknowledged improvement in interdisciplinary meetings about end-of-life issues since study initiation. These findings together with the incidence of written DNI-DNACPR decisions in the overall study population that remained low after the intervention (3.4%) in comparison to the 1-year mortality (14.5%) highlights the need to additionally coach the entire team in future studies to ensure safety and enhance speaking-up for the benefit of patients, more specifically among nurses.

Due to deviation from the predefined hypothesis, we cannot entirely claim a causal effect between our intervention and the first primary endpoint. However, the statistically significant increase in written DNI-DNACPR decisions from 1.9% in the control to 3.4% in the intervention arm within the overall study population, both in our naive (P = 0.011) and adjusted post hoc analysis (P < 0.001), along with the linear coaching exposure effect pattern, strongly suggest that our intervention bolstered doctors’ self-confidence in implementing goal-oriented care operationalized via written DNI-DNACPR decisions. This is further supported by a concomitant statistically significant increase in nearly all treatment-limitation-decisions other than DNI-DNACPR after the intervention, including the decision of not referring the patient to the medical ICU (supplementary File 2). Moreover, we found no significant increase in 1-year mortality, suggesting with a high probability that our intervention did not make doctors overconfident in taking written DNI-DNACPR decisions. Therefore, such decisions should not be considered as a death sentence but rather as a way to clarify to patients, relatives and the team what can fairly be expected from additional interventions in case of deterioration. This observation aligns with meta-analyses that did not find a relationship between palliative care or surrogate decision-making interventions and patient survival [52, 53]. After adjustment for confounders we found a statistically significant increase in 1-year mortality in PET, however, without concomitant increase in written DNI-DNACPR decisions. Whether this finding should be considered as a true indirect effect of the intervention or as a spurious result due to underpowering should be assessed in future studies.

Our intervention was not associated with an overall increase in EDMCQ. We observed only an increase in the factor “ethical awareness” and a decrease in the factor “mutual respect within the interdisciplinary team” (Fig. 2). Although the intervention increased self-reflection as reported by the coach (supplementary File 2) and subsequent self-confidence in operationalizing goal-oriented care via written DNI-DNACPR decisions, this finding suggests that the intervention helped doctors in enhancing ethical awareness in the team, however, without really empowering and involving the team during ethical decision-making. Failed expectations about the intervention together with the improvement in ethical awareness may have increased moral distress, more specifically among nurses, with a decrease in mutual respect in the interdisciplinary team as a consequence in some departments. This is another argument for coaching entire teams in future interventions.

Limitations of this study

Our study has several limitations. First, external validation of the results may pose a challenge. However, the positive intervention effect across most of the departments in combination with the high variability in ethical decision-making climate (supplementary File 2) suggest that our findings may be extrapolated to other centers. Second, external validation might be further compromised by the fact that the success of the intervention is largely coach-dependent. To guarantee the safety of the doctors and maximize the pre-test probability of the intervention’s effectiveness, we decided to collaborate with experienced senior clinical psychologists who were trained in systemic psychodynamic coaching and who received supervision by an internationally recognized coaching expert and faculty. Coaches were also asked to follow as much as possible the coaching protocol [20]. Nevertheless, the success of such interventions has been shown to be more dependent on the quality of the human relationship between the coach and the coachee than on the intervention itself [51]. Doctors’ satisfaction with the coach and coaching sessions in our study was good to excellent [unpublished]. Third, by the nature of the stepped-wedge design, with the timing of the intervention per department difficult to conceal, blinding the patients and relatives to the intervention could not be guaranteed. Fourth, because of pre-post design, we cannot claim causality regarding our second primary endpoint. Fifth, written DNI-DNACPR orders are only a proxy of decisions that many patients would make in the context of life-limiting illness [1, 54]. Therefore, albeit written DNI-DNACPR orders can easily and objectively be measured, they do not truly measure or represent the ground truth of either goal-oriented care or shared decision making. Finally, due to deviation from the initial targeted population, we were unfortunately not able to assess the impact of our intervention on satisfaction and wellbeing of patients and their relatives. Although we think it is reasonable to assume that our intervention also stimulated the development of a relationship with patient and relatives during shared decision-making in a similar way as in a team [20, 47], this should be certainly assessed in future interventions.

Conclusion

Our study suggests that coaching doctors regarding ethical decision-making in teams improves goal-oriented care operationalized via written DNI-DNACPR decisions in adult patients during their first hospital stay without increasing their mortality at 1 year. However, the small number of PET identified by the teams and the absence of improvement in ethical climate highlight the need to additionally coach entire teams in future interventions.