Keywords

7.1 Introduction

The COVID-19 pandemic has brought into sharp focus the importance of thinking about what constitutes ethical use of data in a public health emergency. This is because the pandemic has resulted in the generation of an unprecedented and exponentially mounting volume of data, including individual-level health data. The timely and appropriate use of such data (e.g. data from public health surveillance, electronic health records and research projects) has great potential to contribute to successful public health policies, effective therapeutic interventions and enhanced public support for, and trust in, governmental responses to the pandemic (Han et al. 2021; Rios et al. 2020). However, the collection and use of individual-level health data may also be problematic if not managed appropriately. Concerns have been raised about privacy and data protection (Ienca and Vayena 2020). These are particularly relevant in certain situations: first, in relation to the use of surveillance and registry data and the extent to which reporting individual-level health data may violate trust and create fear of stigmatization and discrimination (Bayer and Fairchild 2000); and second, in the context of the collection and curation of data (particularly when there are concerns about data being sensitive) by novel and remote platforms (Newlands et al. 2020). In both these instances, the apparent loss of individual control over the collection and use of personal health data is seen to strike at the heart of values such as autonomy, privacy and trust (Vayena and Blasimme 2017).

In the context of conducting research during the pandemic, issues relating to the use and sharing of individual-level health data have taken on increased urgency. There have been calls for greater use of transparent and open-science data-sharing options, and speedier sharing of data to inform COVID-19 patient management and response (Moorthy et al. 2020; Homolak et al. 2020). International research funders such as the Bill and Melinda Gates Foundation and Wellcome have mandated that their grantees share data from research related to COVID-19 as soon as the data collection is completed, regardless of publication status. International expert and working groups have been established to facilitate effective, equitable and ethical data-sharing (Fegan et al. 2021).

Most countries have laws that regulate the processing of personal information, which include the managing and sharing individual level health data. The European General Data Protection Regulation (GDPR) (2016) is an example of such a document. Researchers should be mindful of the legal duties imposed by such regulations. However, almost all of these legislative frameworks are likely to provide exemptions for the processing of data for special reasons or if they fall within special categories such as reasons of public interest relating to public health (for example, Article 9(2)(i) of the GDPR).

As demonstrated by the cases in this chapter, a number of ethical issues arise from the use of different kinds of data, and the ways in which they are collected, processed and shared in the context of research during a pandemic. Two broad principles are generally associated with managing and sharing health data in research: first, that researchers should ensure research is carried out in a way that is respectful of persons and communities; and second, that the research is carried out in a manner that is fair to stakeholders, i.e. that it promotes equity. Although the issues raised in these cases engage the same broad ethical principles as research in non-emergency settings, two observations can be made. First, the principles play out against a very different background during a pandemic, when the value of data may be perceived quite differently. The literature suggests that the value of collecting, processing and sharing data during a pandemic is most often linked to the utility of the data, which is commonly measured by assessing potential benefits in terms of (1) the reduction of suffering of current and future populations, (2) improvement of quality of life during and after the pandemic and (3) the reduction of the socio-economic impact of the pandemic (Bull et al. 2015b; Pratt and Bull 2021). Whether and in what way the approaches to maximizing the utility of data during a pandemic might be justified, or come into conflict with the traditional ethical principles noted above, will be considered by reference to the case studies in this chapter. Second, these cases highlight the importance of context and the need to recognize the salient considerations that should inform how the two broad ethical principles of specific relevance – respect for persons and communities, and promoting equity – should be taken into account during a pandemic.

7.2 Respect for Persons and Communities

It is important that researchers and other secondary users of data respect the persons and communities they engage with. The principle of respecting persons recognizes that individuals should be treated as autonomous agents, and that persons with diminished autonomy are entitled to protection (National Commission 1979). As autonomous agents, individuals and communities have the right to make their own decisions about how their information is collected and used. Respecting persons and communities means that individuals must provide consent to the use of their data, and that their privacy is assured. There are certain situations where waivers of consent are permitted provided that certain conditions are met (CIOMS 2016). But even in such situations, researchers are required to consider whether the consent process might be modified in order to preserve as much of the individual’s autonomy as possible.

7.2.1 The Consent Process

A properly executed consent process is a vital aspect of respecting individual research participants. As part of such a process, adequate information is provided to participants, and participants understand what is proposed, including the nature of any risks and benefits to them and how such risks are to be managed and minimized. Moreover, this consent should be voluntarily given and researchers should ensure that there is no undue influence or deception involved. Participants should also be aware that they may withdraw from the research at any time without the need to provide any explanation or justification, and without any penalty or prejudice to any treatment they may be receiving (CIOMS 2016).

Researchers engaged in collecting, processing and sharing health data should ensure that effectively designed consent processes are put into place. Participants should be provided with information that is explained in a way that allows them to have an appropriate level of understanding. This information should be sufficient and relevant. Participants should understand what health data will be collected and for what purposes, and how their data will be stored and shared. Researchers must carefully consider the risks to participants, and, where relevant, to their communities. In cases where potential participants may have low literacy levels, be unfamiliar with health research, and suffer from social vulnerability, extra measures may need to be taken to ensure that they are able to provide meaningful consent (Bhutta 2004; Cheah et al. 2018).

Given the complexity and abstract nature of data collection, use and processing, particularly in the context of mobile applications and novel platforms, and given most people’s unfamiliarity with it, providing accessible information about data-sharing can be challenging. Case 7.1 illustrates this with an example of a mobile application that collects both sensitive personal data and other personal data. The data collected and processed by the application are used and shared with different parties in accordance with the legal restrictions in place and on the privacy policy of the developers. The envisaged creation of a consortium and data pool for research purposes will mean that the application will be used both as a source of data for research and as a tool for individuals to assess their symptoms. The many different activities associated with the application, as well as the many different ways in which data may be processed and shared, prompt consideration of researchers’ obligations to design a consent process that ensures participants understand how their data may be used and shared in the context of research and what measures will be taken to protect their privacy.

Significantly, during a public health emergency, research takes on a new urgency, and the extensive collection, processing and sharing of data can be particularly valuable. In relation to the ethical principle of requiring meaningful consent from participants, this raises the challenge of whether and how the prioritization of research during a pandemic should be balanced against traditional consent mechanisms. In such situations, modifications of the consent process may be permitted if there is no other feasible or practicable option, the research has important social value and it poses no more than minimal risks to participants. This will be explored in Cases 7.1 and 7.2 in relation to two aspects of consent: consent for future use of data and approaches that involve waiving consent or opting out of sharing.

7.2.2 Consent for Use of Data in the Future

Case 7.1 envisions a variety of future uses and sharing of data, which then raises the issue of what sort of consent would be appropriate in such a case. There has been much debate about appropriate models of consent to allow sharing, storage and future use of data. There is a spectrum of approaches to consent for the future use of data. It ranges from “specific consent” (where the participant would be re-contacted for permission in connection with any future research study) to “blanket consent” (where any use is permissible, including uses unrelated to health) to no consent at all (Tindana et al. 2019). Between the two extremes, there has been increasing support for “broad consent”, which is consent for unspecified future use as long as the future use is within the scope of the broad consent, for example “health research” or “malaria research”, and with appropriate governance processes in place (CIOMS 2016). However, some authors have challenged the concept of “broad consent”, asking whether it can constitute informed consent (Sheehan 2011). If the project in Case 7.1 has not set a time limit on how long it will hold participants’ information and what sort of research might be carried out (given the value of the data in the time of COVID-19) the researchers may be requesting blanket consent.

In considering whether “blanket consent” for future use of health data in relation to Case 7.1 might be justified during a pandemic, it may be worth taking note of certain factors. The first is the utility of digital symptom trackers as a public health tool. The literature suggests that symptom trackers are very valuable tools for monitoring a public health threat and enabling a quick response. They also provide governments with information to assist them in allocating resources and generally minimizing or controlling outbreaks (Gasser et al. 2020). From a public health perspective, the use of blanket consent might be the most efficient way of maximizing the utility of such a tool, and careful consideration is required of how the interests of participants should be balanced with threats to public health and safety in a pandemic.

7.2.3 Waiving Consent and the Opt-Out Approach

Case 7.2 considers whether waivers of consent and opt-out approaches are appropriate in the context of a pregnancy outcomes registry. Waivers of consent must typically be approved by research ethics committees (CIOMS 2016) and are approved only when specific conditions are met, such as “the waiver or alteration will not affect the rights of the subjects, the research cannot be carried out without the waiver, and, when appropriate, subjects will be provided information after participation” (Berg et al. 2018). However, the commentary to Guideline 10 of the Council of International Organizations of Medical Sciences (CIOMS) guidelines provides special considerations for waiving informed consent in studies using data from health registries, citing the importance of having comprehensive and accurate information about an entire population, the avoidance of undetectable selection bias and the need to equitably distribute benefits and burdens across a population (CIOMS 2016).

Significantly, in relation to public health emergencies, the guidance seeks to balance the social value of such research against the risks of violating individual autonomy. First it recognises in the commentary that when such studies are conducted under public health mandates or by public health authorities, no ethical review of waiver of consent is needed as the research is mandated by law. However, consent cannot be waived by a public health authority if the research combines data in a registry with new activities that involve direct contact with participants. Moreover, even in such situations, it stipulates that when the use of such data no longer constitutes a public health activity, researchers must seek individual consent or obtain ethics review approval to waive consent under the conditions in Guideline 10 (CIOMS 2016). It is important to recognise that the ethical justification for a public health mandate is a separate question entirely. Whether or not any particular public health mandate is ethical will depend on a separate ethical analysis based on principles related to public health ethics such as the harm principle, least coercive or restrictive means, reciprocity principle and transparency principles (Upshur 2002). Second, in the absence of a legal mandate, when considering whether to waive individual consent, researchers and research ethics committees are required to consider whether there are any other modifications that can be made to the informed consent process that would allow for the greatest expression of individual autonomy (CIOMS 2016).

In this case, researchers chose an opt-out approach, which on the face of it appears appropriate and respectful of the rights of patients. According to CIOMS (2016), an opt-out procedure must fulfil the following conditions: (1) participants need to be aware of its existence; (2) sufficient information needs to be provided; (3) participants need to be told that they can withdraw their data; and (4) a genuine opportunity to object has to be offered. These requirements may be challenging to fulfil in research conducted with disadvantaged or marginalized groups (CIOMS 2016). Despite the fact that this registry was established in a high-income country, information was collected from staff providing maternity care across a range of health settings and would have very likely included data about women from disadvantaged and marginalized groups. Significantly, in Case 7.2, despite the fact that none of the women opted out of the research, fewer than 10 cases were added in the early months of the pandemic. Whether this was attributable to an effective national COVID-19 response or the reluctance of health providers to register women, it limits the social value of the research and the utility of the dataset. In the context of a public health emergency, researchers and ethics committees may need to give particular regard to whether maximizing the utility of a pregnancy outcomes registry may warrant a waiver of consent.

7.2.4 Privacy

Protecting the privacy and confidentiality of participants remains one of the main concerns of the storage, access to, management of and sharing of data in research (Bull et al. 2015a). Custodians of data are required to make arrangements to protect the confidentiality of the information linked to the data and to limit access to the material relating to third parties (CIOMS 2016). If there is a breach in confidentiality, participants may risk being blamed or stigmatized, and public trust in science and research may be undermined. In the context of COVID-19, for example, some COVID-19 patients have had to respond publicly to allegations about non-compliance with public health measures and defend themselves (Atan 2020). In such contexts people with symptoms of COVID-19 may not want to come forward and get tested. There are two important privacy-related issues that arise from the cases in this chapter: the issue of de-identification of data and the use of existing datasets.

7.2.5 Curation, De-identification and Anonymization

Careful curation and de-identification of data is frequently offered as a way of protecting individual anonymity, but some have argued that identifiability exists on a continuum (Rothstein 2010). Although personally identifying information, such as name, address and date of birth, are omitted when data are shared, the data may still have identifying characteristics. Low numbers of enrolled patients may also heighten the risks of identification, as noted in Case 7.2. In addition, data scientists have proved on multiple occasions that datasets that were thought to be anonymized could be linked with other public health data to identify the specific data subject (Sweeney 2000). With the advent of Big Data and an increasing move to link large databases and permit exploration with machine learning and artificial intelligence approaches, it may become increasingly difficult to ensure the anonymity of individuals, and researchers should be aware of the risks to both individuals and communities in the event of re-identification. This is a particular concern in relation to the sharing of health records, and the lack of consistency in the applications of anonymization remains an unresolved issue.

It is also important to bear in mind the possibility of harm to groups or communities in the form of stigma or discrimination. Although data are de-identified at the individual level before they are shared, the dataset might still be attributable to a certain community, and risks of stigmatization or discrimination should be taken into account. This could impact employment opportunities or lead to discrimination by insurance companies (Rothstein 2010).

Sharing qualitative data poses an additional set of challenges. This is especially true when the data collected are derived from individuals who are easily identifiable by their profession, location, idiolect or opinions (such as in Case 7.3) and/or address intimate aspects of people’s lives and are considered very sensitive (as in Case 7.4). It is generally agreed that the more potentially identifying information is removed from a dataset, the less useful such data may be. The ethics committees in Cases 7.3 and 7.4 flag this very issue, and are concerned about preserving the confidentiality of participants.

Cases 7.27.4 raise concerns about possible re-identification of participants, and it is worth exploring who these participants are in each case and whether in the context of a pandemic, it may be ethical to proceed with research despite the possible risk of identification. Researchers and ethics committees will need to identify and balance the potential risks to the specific group of participants against the anticipated utility of the data in relation to answering important questions related to the public health emergency.

7.2.6 Use of Existing Datasets

Challenges arise when researchers seek to use archived data from prior research, clinical care or other public health activities without having obtained informed consent from participants for their future use. An example of this type of study is a review of old hospital records, where participants have not been asked if they consent to their data being used for research purposes in the future. Another example is the use of previously collected datasets generated for another purpose, as in Case 7.4, where the proposal was to use health data from earlier surveys to identify potential participants for proposed research. In such cases, the CIOMS 2016 guidelines state that “the research ethics committee may waive the requirement of individual informed consent if: (1) the research would not be feasible or practicable to carry out without the waiver; (2) the research has important social value; and (3) the research poses no more than minimal risks to participants or to the group to which the participant belongs” (CIOMS 2016). In Case 7.5, researchers were proposing to contact some participants in the survey to invite them to participate in a new study, but participants had not provided consent to be contacted. The proposed study could directly address their health needs. In the context of Case 7.5, it seems that criteria 1 and 2 may be met. As for criterion 3, researchers should find ways to minimize any risks to participants, including when contacting them to invite them to take part in a study. It is important to bear in mind that some of these risks, such as being stigmatized, may not be obvious to the researcher.

7.3 Promoting Equity

It is important that primary and secondary users of data ensure that approaches to data processing and use are equitable (GLOPID-R 2018). They should recognize and balance the needs of the different communities involved. This includes those who collect and generate the data, secondary users of the data, the individuals and communities from which the data originate, and the funders of the collection effort (Vayena and Blasimme 2017). Sharing data widely or allowing completely open access to data, with minimal governance mechanisms and oversight, has previously generated significant concerns related to equity, such as lack of recognition of the efforts of data generators, inequitable access to data, and failure to ensure fair benefits to study participants and communities, especially when dealing with data collected from potentially vulnerable populations (Pratt and Bull 2021).

Researchers working in low-resource settings have raised the issue that data-sharing has the potential to exacerbate existing inequalities between health researchers working in low-resource and high-resource settings (Serwadda et al. 2018). They worry that they are reduced to data collectors in the service of highly skilled data analysts and statisticians from high-resource settings. Some approaches that can be taken to prevent this from happening include sharing credit for scientific outputs, ensuring that capacity-building measures are built into research proposals, and ensuring that collaborations are negotiated on the basis of mutually beneficial data-sharing arrangements. Unfortunately, time and resources are often scarce during a pandemic, when the research imperative takes on a new urgency and some of these approaches may not be feasible. For instance, the national research ethics committee in Case 7.3 is described as unable to function optimally, owing, among other things, to lack of resources and inadequate training. A pandemic may not be an ideal time to focus on building the capacity of existing ethics committees when research needs to be carried out in a timely fashion. In such situations, it is suggested that other organizations may need to provide assistance to local research ethics committees to help them overcome these challenges (Smith and Upshur 2019).

Participants and communities involved in research have a valid interest in experiencing the benefits of research arising from the use of their data. In the context of low- and middle-income countries and vulnerable populations, particular attention should be paid to research projects that rely on technologies that are not accessible by or available to certain populations. Fair access to the benefits of research should require that individuals or communities are not excluded from the potential benefit of participating in research because of a digital divide. These concerns may be relevant to Case 7.1, which involves the use of a mobile application developed and deployed in Europe and the Americas. Even in high-income countries, researchers should be aware of the need to ensure that vulnerable and marginalized communities are not excluded from research that could confer benefits to them. Pandemics have historically affected disadvantaged communities very differently, with higher rates of infection, mortality and morbidity (Bambra et al. 2020; Osterrieder et al. 2021; Schneiders et al. 2022), and depriving these communities the benefits of research is unethical.

However, it is not always clear what would constitute a benefit and who it should be shared with. Stakeholders have discussed the importance of both direct and indirect benefits (Bull et al. 2015a). Indirect benefits are particularly relevant in the context of secondary research, which may not address health issues of relevance to participants and communities.

It is also worth noting that the sharing of data for commercial purposes may be a sensitive issue, particularly in relation to a public health emergency (Pratt and Bull 2021; Tangcharoensathien et al. 2010; Ghafur et al. 2020). Community expectations and views may also vary considerably, depending on historical, political and cultural contexts, and researchers should be mindful of the interests of communities when sharing data in such contexts. The involvement of commercial interests in research is also seen as potentially problematic, as they may inhibit timely sharing of data and results as well as being reluctant to share negative data and results. Data-sharing procedures should be agreed in advance to ensure timely access to data and results (including negative ones) (GLOPID-R 2018).

7.4 Conclusion

The collection, processing and sharing of individual-level health data are a critical part of public health emergency responses. Timely and effective access to and analysis of data in health research can generate a “deeper understanding of an outbreak, its impact on patients, and effective methods of control – supporting more effective public health responses” (GLOPID-R 2018). However, the collection, processing and sharing of data must be done in an ethically appropriate manner. The broad ethical principles of respecting persons and communities, and promoting equity, which apply to the use of health data in non-emergency research should also remain the foundational principles of data sharing during a public health emergency. However, during a pandemic there are salient considerations that should inform how these broad principles should be applied on a case-by-case basis, as demonstrated above.