- Opinion Article
- Open Access
Including all voices in international data-sharing governance
Human Genomics volume 12, Article number: 13 (2018)
Governments, funding bodies, institutions, and publishers have developed a number of strategies to encourage researchers to facilitate access to datasets. The rationale behind this approach is that this will bring a number of benefits and enable advances in healthcare and medicine by allowing the maximum returns from the investment in research, as well as reducing waste and promoting transparency. As this approach gains momentum, these data-sharing practices have implications for many kinds of research as they become standard practice across the world.
The governance frameworks that have been developed to support biomedical research are not well equipped to deal with the complexities of international data sharing. This system is nationally based and is dependent upon expert committees for oversight and compliance, which has often led to piece-meal decision-making. This system tends to perpetuate inequalities by obscuring the contributions and the important role of different data providers along the data stream, whether they be low- or middle-income country researchers, patients, research participants, groups, or communities. As research and data-sharing activities are largely publicly funded, there is a strong moral argument for including the people who provide the data in decision-making and to develop governance systems for their continued participation.
We recommend that governance of science becomes more transparent, representative, and responsive to the voices of many constituencies by conducting public consultations about data-sharing addressing issues of access and use; including all data providers in decision-making about the use and sharing of data along the whole of the data stream; and using digital technologies to encourage accessibility, transparency, and accountability. We anticipate that this approach could enhance the legitimacy of the research process, generate insights that may otherwise be overlooked or ignored, and help to bring valuable perspectives into the decision-making around international data sharing.
Modern biomedical research has been accelerated by advances in digital technologies , enabling interdisciplinary collaborations to collect, analyze and share different kinds of data across the globe. To facilitate data sharing, governments, funding bodies, institutions, and publishers have developed data infrastructures, policies, and incentives. Global data sharing is anticipated by many commentators to boost progress in health and medicine, reduce research waste, increase research value, and promote research transparency . While most research data are generated by researchers, digital innovation has enabled patients and citizens to be involved in data collection and creation through the use of smart phones and devices, and involvement in citizen-science projects. The scope of data considered to be health-related has also expanded to include demographic, administrative, socio-economic, medical, and behavioral measures as well as genomics and multi-omics data which can be derived from a number of different sources.
While data sharing is strongly encouraged, concerns exist that the existing governance frameworks of data sharing are inappropriate to cope with the size, scope, complexity, and scale of data sharing that is required in modern health research [3,4,5]. Furthermore, we argue that these frameworks tend to perpetuate, or even exacerbate, many of the current inequalities in research and health by obscuring the important role of those who donate data, whether they be low- or middle-income country researchers, patients, research participants, groups, or communities . This paper will discuss some of the inadequacies of the existing governance systems for research and suggest an approach to bring the voices of data contributors into decision-making around data sharing.
Flaws of the current data-sharing governance frameworks
Current data-sharing governance frameworks are not attuned to the needs of multi-institutional or multi-jurisdictional health research . In most countries, governance systems primarily are tied to territorial jurisdictions; these systems were originally developed to protect personal data within a single jurisdiction. Efforts to combine or sequentially apply territorial governance structures across borders may lead to undue complexity and bureaucratic regulatory thickets resulting in inefficient data flows characterized by delays in the research progress [7, 8]. Applying local or territorial governance to broader areas overloads the structures to a point where they are either inappropriate or lose their moral legitimacy by failing to reflect the concerns, interests, and values of all of the data contributors involved. This means that the movement of data becomes a series of contractual transactions or transfers in which it is very difficult to know if data have been used according to the wishes of data donors and curators further upstream in the research process as the data move through global networks.
Such a system also has the tendency to perpetuate inequalities by obscuring the contributions of different stakeholders along the data stream. This is particularly problematic if it creates a proportionately greater disadvantage for people in resource-poor countries to contribute to, be recognized for, and benefit from science. It also does not recognize the connectivity between data providers , and may contribute to muting the voices of the data contributors as data move progressively away from them. This lack of connection between the data providers and those who use the data can lead to potentially unjust outcomes that are not in accordance with the expectations of those who are providing data. Ultimately most data on individuals requires consent to be obtained. If data are shared in ways that appear contrary to the interests of the subjects and donors of that data, this could lead to a loss of trust in the research process and to people withholding consent and refusing to participate in further research.
Institutional Review Boards  or Research Ethics Committees  are the key governance bodies for research overseeing the recruitment of participants into the initial study, when data are generated. These committees often are based locally and must make decisions on behalf of participants domiciled within their jurisdictions. In multi-institutional research, several such committees perform their reviews in a succeeding manner with the same goal of protecting the interests of local participants, researchers, and institutions. This can lead to differences in decision-making, not only just between national committees, but also committees within the same jurisdiction. Although there may be patients and participants sitting on these committees as representatives, decisions are made on behalf of research participants by experts. The end result of this piece-meal decision-making is that it may not be in accordance with what individuals, groups, and communities want in terms of international data sharing . While in many regions of the world such as sub-Saharan Africa, mechanisms have been developed to engage communities and obtain their consent before research implementation [12, 13], such mechanisms are not broadly adopted in western countries or other countries with advanced biomedical research. Generally, community perspectives and formal public consultations are not requirements in the deliberations and decision-making regarding international data sharing.
Proposed new data-sharing governance frameworks
Recently, platforms have been developed to support the use of scientific data and to address some of the challenges described above. Examples include the newly proposed European Open Science Cloud , the Global Alliance for Genomics and Health [15, 16], the Database of Genotypes and Phenotypes (dbGaP) , and the European Genome-phenome Archive . Within these managed access platforms, new forms of governance are applied that sit alongside the traditional oversight bodies of expert ethics committees. For instance, Data Access Committees have become a standard oversight body for the approval of access to research platform by researchers , with some committees undertaking responsibility for a number of studies such as in the case of METADAC . To facilitate access to data, new governance mechanisms such as ‘pop-up’ governance  have been developed by research consortia. In such projects, data access committees are established at a consortium level and provide greater control for the researchers who have collected the data, and, potentially, for the participants who donate their data as well.
Many of these initiatives have largely been instigated by researchers and funding bodies in high-resourced countries, supported by foundational documents such as the Fort Lauderdale Agreement of 2003  and the Toronto Statement of 2009 . These governance systems have varying levels of participation from different stakeholders, but overall, the data-sharing agenda usually is formulated by a handful of stakeholders such as funding agencies and research organizations, in which regional or national regulatory frameworks have been considered on a case-by-case basis. Generally, discussions on biomedical data sharing are conducted from the perspective of those leading the development of the current global infrastructure for health science research, namely, academia, pharmaceutical companies, clinical health IT systems, funders, insurers, and policy-makers. Although all of them would state that they are fundamentally concerned with health and the furtherance of the public good, the vested interests of this infrastructure (i.e., academic achievement, the profit motive, international political competition) may not align with the solutions identified above . This can not only lead to inequalities between researchers in low- and high-resourced countries , but also an agenda that does not necessarily embody the concerns of research participants.
To some extent, patient organizations have undertaken similar initiatives to create opportunities for both data generation and sharing. Examples are enabling research participants or patients to directly upload health information or share genetic variants , or requiring collaborating investigators to share data as a condition of receiving registry data or biobank samples. While this has become more of the norm in terms of rare diseases, in other areas, this is not necessarily the case. The more commonplace situation for global infrastructure is that the people from whom the data originate, i.e., the research participant, the patient receiving treatment, the family member providing blood, or the community volunteering information, are not routinely provided with the opportunity for involvement in the design and management of data-storage decisions, or sharing practices [27, 28].
An inclusive and global approach
There is a pressing and growing need for a data-sharing governance framework that is inclusive, global, and transparent. Such a framework should enable the involvement of all of those who have contributed the data and who have an interest in the health research carried out, while still accounting for variations in those interests across different stakeholders. Further, it would stretch beyond national borders, include the lens of responsible data use and accountability, and enable democracy in a pluralistic global community. The key challenge becomes, then, how to reform the national data-sharing governance structure and to scale it up to a global level to produce a structure that is attentive to stakeholders from all of the research constituencies (research users, public and participants, industry, policy-makers, and researchers themselves) from all of the territories that those stakeholders inhabit (not just the western research paradigm) . Such a perspective will encourage paradigm shifts in the way we manage and disseminate data, even in the face of entrenched systems . To achieve this, we need to orient the discussions about priorities around the individuals from whom the data are obtained and to include a broader range of constituencies in planning and oversight.
Currently, generation and access criteria for data, samples and other materials vary considerably between national and international repositories in terms of permission levels, review bodies and policy requirements . Ensuring global equity in the value of large scale data-sharing will require systems that are used in all countries and recognized by all users and producers of data. The same advances in technology that present the challenges described above can also be part of the solution, enabling a better representation of the various values and cultures of the data providers and the communities they represent. With regards to a technical infrastructure, developing a global governance system that is appropriate for sharing and enabling the appropriate flow of data will require interoperable systems across all countries that are secure, transparent and accountable , and accessibility by all contributors, producers, and users of data [32, 33]. While individual systems employed by discrete projects need not be uniform, central or common, they should be harmonized so as to be designed around a transparent set of features that make them compatible with those used elsewhere, and comprehendible to an external audience.
Such systems must enable local and regional and socio-cultural decisions to be made with regard to sharing that respect the participants, and still permit the available data to be shared. Enabled by advances in electronic communication, the basic governance mechanisms that will allow this are in the process of development. Examples are the ORCID identification initiative, building a registry of unique researcher identifiers that could be used as a passport for allowing access to datasets and legitimate uses of data ; ‘Dynamic Consent’, an online interface that enables ongoing communication between data volunteers (participants) and users (researchers), and allows consent preferences to be modified over time ; and ÉCOUTER, an online tool for engaging and analyzing stakeholder perspectives [36, 37]. Widespread adoption of these systems would promote integrity, accountability, legitimacy, and trust.
Importantly, data-sharing governance structures must include the general public in setting priorities and determining how data are used. If we assume that data sharing is a means to an end, and that end is to benefit all communities equally, then the gold standard of public consultation should be to begin in a place that considers all communities in their own context. This means avoiding hypothetical person-centered approaches that presume a universal ‘reasonable person’ standard: if we want to understand how data sharing should occur in a particular country or region or how research participants can or do contribute to data-sharing planning, we need to do so empirically using a collection of robust and recognized social science approaches [6, 12].
In practice, we have a long way to go to achieving this ambition, but it is a perspective that will provide a strong framework for considering new options and could be transformative in integrating the tensions between the collective and the individual. In some countries, recent initiatives by funders of healthcare research to engage members of their public in decision-making processes have been praised for helping to re-direct research questions towards the needs of patients . It is now widely accepted that research questions and research design benefit from public consultation, and the same approach should apply to the design of data-sharing mechanisms, particularly when drafting data management plans and governance plans for data infrastructure, rather than this being the sole responsibility of researchers. Securing the involvement of both scientific and participant data providers in the decision-making of consortia and data infrastructure is another mechanism to account for publicly funded research activities and ensuring that these voices are heard.
Good practices for data sharing must evolve towards an interoperable set of standards, permitting sharing across borders. Good practices must also ensure that all views are heard and taken into account when defining these standards. The governance of science will have to become more transparent, representative, and responsive to the voices of many constituencies, with all the prospects for discord, compromise, and delay that this implies. We argue that as research and data-sharing activities are largely publicly funded, there is a strong moral argument for including the people who provide the data in decision-making and to develop governance systems to enable their continued participation. We further argue creative engagement of individual, familial, and community data volunteers with all stakeholders across boundaries, whether disciplinary, institutional, or national, has the potential to generate insights that may otherwise be overlooked or ignored when global perspectives are less comprehensive. This creative engagement can also reduce research waste [39, 40] by removing duplication and avoiding avenues of exploration already known to be dead ends. We also argue that appropriate data sharing can, and does, permit significant advances in our understanding of many complex health-related issues, helping to address currently unmet health needs.
The question is whether we have the political will to develop and facilitate these good data sharing practices. We have the technology that can enable us to do this now, in system examples such as ORCID; ‘Dynamic Consent’; and ÉCOUTER. Funding agencies and research organizations, which recognize the potential efficiency gains from data sharing and are critical in the formulation of the data-sharing agenda should review and revise their data sharing policies to involve participants .
When pursued from the perspective of the donors of information, and structured for their benefit, collective exploitation of data between stakeholders and across communities will create the possibility of improving efficiency and more importantly, of addressing inequalities.
Borry P, Bentzen HB, Budin-Ljøsne I, Cornel MC, Howard HC, Feeney O, et al. The challenges of the expanded availability of genomic information: an agenda-setting paper. J Community Genet. 2017; Available from: http://link.springer.com/10.1007/s12687-017-0331-7.
Rowhani-Farid A, Allen M, Barnett AG. What incentives increase data sharing in health and medical research? A systematic review. Res Integr Peer Rev. 2017;2:4.
Field D, Sansone S-A, Collis A, Booth T, Dukes P, Gregurick SK, et al. Omics data sharing. Science. 2009;326:234–6.
Murtagh MJ, Turner A, Minion JT, Fay M, Burton PR. International data sharing in practice: new technologies meet old governance. Biopreservation Biobanking. 2016;14:231–40.
Joly Y, Dove ES, Kennedy KL, Bobrow M, Ouellette BFF, Dyke SOM, et al. Open science and community norms: data retention and publication moratoria policies in genomics projects. Med Law Int. 2012;12:92–120.
Avard D, Bucci L, Burgess M, Kaye J, Heeney C, Farmer Y, et al. Public health genomics (phg) and public participation: points to consider. J Public Deliberation. 2009;5 Available from: https://www.publicdeliberation.net/jpd/vol5/iss1/art7/. Accessed 28 Feb 2018.
Kaye J, Heeney C, Hawkins N, De Vries J, Boddington P. Data sharing in genomics–re-shaping scientific practice. Nat Rev Genet. 2009;10:331.
Kaye J, Hawkins N. Data sharing policy design for consortia: challenges for sustainability. Genome Med. 2014;6:4.
Coicaud J-M, Tahri I. Nationally based data: challenges for global governance (and global policy). Glob Policy. 2014;5:135–45.
Goldenberg AJ, Maschke KJ, Joffe S, Botkin JR, Rothwell E, Murray TH, et al. IRB practices and policies regarding the secondary research use of biospecimens. BMC Med. Ethics. 2015;16:32.
Parker M. Ethical considerations related to mobile technology use in medical research. J Mob Technol Med. 2012;1:50–2.
de Vries J, SJ B, Doumbo O, Ibrahim M, Mercereau-Puijalon O, Kwiatkowski D, et al. Ethical issues in human genomics research in developing countries. BMC Med. Ethics. 2011;12:5.
de Vries J, Munung SN, Matimba A, McCurdy S, Ouwe Missi Oukem-Boyer O, Staunton C, et al. Regulation of genomic and biobanking research in Africa: a content analysis of ethics guidelines, policies and procedures from 22 African countries. BMC Med. Ethics. 2017;18:8.
European Open Science Cloud | Open Science-Research and Innovation-European Commission. Available from: https://ec.europa.eu/research/openscience/index.cfm?pg=open-science-cloud. Accessed 28 Feb 2018.
Editorial. The ups and downs of data sharing in science. Nat News. 2016;534:435–6. https://doi.org/10.1038/534435b.
Global Alliance for Genomics and Health. A federated ecosystem for sharing genomic, clinical data. Science. 2016;352:1278–80.
dbGaP - NCBI. Available from: https://www.ncbi.nlm.nih.gov/gap. Accessed 28 Feb 2018.
EMBL-EBI. Available from: https://www.ebi.ac.uk/ega/. Accessed 28 Feb 2018.
Shabani M, Dyke SOM, Joly Y, Borry P. Controlled access under review: improving the governance of genomic data access. PLoS Biol. 2015;13:e1002339.
metadac. Available from: https://www.metadac.ac.uk/. Accessed 28 Feb 2018.
Kaye J, Muddyman D, Smee C, Kennedy K, Bell J, UK10K. “Pop-up” governance: developing internal governance frameworks for consortia: the example of UK10K. Life Sci Soc Policy. 2015;11:10.
The Wellcome Trust. Sharing data from large-scale biological research projects: a system of tripartite responsibility [internet]. Fort Lauderdale; 2003. Available from: https://www.genome.gov/pages/research/wellcomereport0303.pdf. Accessed 28 Feb 2018.
Toronto International Data Release Workshop Authors, Birney E, Hudson TJ, Green ED, Gunter C, Eddy S, et al. Prepublication data sharing. Nature. 2009;461:168–70.
Macleod MR, Michie S, Roberts I, Dirnagl U, Chalmers I, Ioannidis JPA, et al. Biomedical research: increasing value, reducing waste. Lancet. 2014;383:101–4.
de Vries J, Tindana P, Littler K, Ramsay M, Rotimi C, Abayomi A, et al. The H3Africa policy framework: negotiating fairness in genomics. Trends Genet TIG. 2015;31:117–9.
Swan M. Emerging patient-driven health care models: an examination of health social networks, consumer personalized medicine and quantified self-tracking. Int J Environ Res Public Health. 2009;6:492–525.
Upshur RE, Lavery JV, Tindana PO. Taking tissue seriously means taking communities seriously. BMC Med. Ethics. 2007;8:11.
Haug CJ. Whose data are they anyway? Can a patient perspective advance the data-sharing debate? N Engl J Med. 2017;376:2203–5.
Yoshizawa G, Ho CW-L, Zhu W, Hu C, Syukriani Y, Lee I, et al. ELSI practices in genomic research in East Asia: implications for research collaboration and public participation. Genome Med. 2014;6:39.
Mort M, Mary Roberts C, Furbo MK, Wilkinson J, Mackenzie A. Biosensing: how citizens’ views illuminate emerging health and social risks. Health Risk Soc. 2015;17:605–23.
Kaye J, Briceño Moraia L, Mitchell C, Bell J, Bovenberg JA, Tassé A-M, et al. Access governance for biobanks: the case of the bioshare-eu cohorts. Biopreservation Biobanking. 2016;14:201–6.
De Castro P, Calzolari A, Napolitani F, Maria Rossi A, Mabile L, Cambon-Thomsen A, et al. Open data sharing in the context of bioresources. Acta Inform Medica. 2013;21:291–2.
Mabile L, Dalgleish R, Thorisson GA, Deschênes M, Hewitt R, Carpenter J, et al. Quantifying the use of bioresources for promoting their sharing in scientific research. GigaScience. 2013;2:7.
Haak LL, Fenner M, Paglione L, Pentz E, Ratner H. ORCID: a system to uniquely identify researchers. Learn Publ. 2012;25:259–64.
Kaye J, Whitley EA, Lund D, Morrison M, Teare H, Melham K. Dynamic consent: a patient interface for twenty-first century research networks. Eur J Hum Genet EJHG. 2015;23:141–6.
Murtagh MJ, Minion JT, Turner A, Wilson RC, Blell M, Ochieng C, et al. The ECOUTER methodology for stakeholder engagement in translational research. BMC Med Ethics. 2017;18:24.
Wilson RC, Butters OW, Clark T, Minion J, Turner A, Murtagh MJ. Digital methodology to implement the ECOUTER engagement process. F1000Research. 2016;5:1307. https://doi.org/10.12688/f1000research.8786.1.
Pizzo E, Doyle C, Matthews R, Barlow J. Patient and public involvement: how much do we spend and what are the benefits? Health Expect. 2015;18:1918–26.
Chalmers I, Bracken MB, Djulbegovic B, Garattini S, Grant J, Gülmezoglu AM, et al. How to increase value and reduce waste when research priorities are set. Lancet. 2014;383:156–65.
Chan A-W, Song F, Vickers A, Jefferson T, Dickersin K, Gøtzsche PC, et al. Increasing value and reducing waste: addressing inaccessible research. Lancet. 2014;383:257–66.
Examples NIH Data Sharing Policy- NIH Grants https://grants.nih.gov/grants/policy/data_sharing/. MRC Data Sharing Policy at https://www.mrc.ac.uk/research/policies-and-guidance-for-researchers/data-sharing/. Accessed 28 Feb 2018.
The authors would like to thank the Brocher Fondation, Route d’Hermance 471, CP 21 CH-1248 Hermance, Switzerland for hosting the workshop, and we thank Dr. Megan Prictor of Melbourne Law School, The University of Melbourne, Australia for her assistance in preparing this paper for publication.
The workshop that lead to this publication was generously supported by the Brocher Fondation, Route d’Hermance 471, CP 21 CH-1248 Hermance, Switzerland Grant 2013 “The Governance of International Consortia – Establishing A Proof of Principle for ELSI 2.0,” awarded to Jane Kaye, Bartha Knoppers, and Anne Cambon-Thomsen. Bartha Knoppers was unable to attend the final workshop.
Jane Kaye: Wellcome Trust Award 096599/2/11/Z; Innovative Medicines Initiative Joint Undertaking under grant agreement number 115317 (DIRECT), resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contribution.
Eric Juengst: UNC Center for Genomics and Society, NIH grant #P50HG004488.
Isabelle Budin-Ljøsne: Biobank Norway funded by the Research Council of Norway—grant number 245464.
Clement Adebayo Adebamowo: NIH grant # 1U01HG007654 and R25TW01514.
Louise Bezuidenhout: Leverhulme Trust (Beyond the Digital Divide (RPG-2013-153).
Emilomo Ogbe: University of Gent BOF startkrediet (BOF.STA.2016.0031.01).
Jusaku Minari: This research is partially supported by RISTEX, Japan Science and Technology Agency (JST), JSPS Grant-in-Aid for Young Scientists (B) (15K19149) and JSPS Challenging Research (Exploratory, 17K19812).
Kazuto Kato: Grant-in-Aid for Scientific Research (16H06279, 17K19812) by JSPS and the LeadER research program on ELSI of genomic medicine by the AMED (The Japan Agency for Medical Research and Development), Japan.
Sarah Coy and Michael Morrison: Innovative Medicines Initiative Joint Undertaking under grant agreement number 115439 (StemBANCC), resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and European Federation of Pharmaceutical Industries and Associations (EFPIA) companies’ in kind contribution.
Harriet Teare: Innovative Medicines Initiative Joint Undertaking under grant agreement number 115317 (DIRECT), resources of which are composed of financial contribution from the European Union’s Seventh Framework Programme (FP7/2007–2013) and EFPIA companies’ in kind contribution.
Availability of data and materials
Ethics approval and consent to participate
No ethics approval or informed consent was necessary. Participants at the workshop were informed that the results from the workshop would be used to write a paper and were invited to contribute to its production.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.