Intended for healthcare professionals
Practice

Responsible governance of genomics data and biospecimens in the context of broad consent: experiences of a pioneering access committee in Africa

Abstract

International collaboration in genomic research is gaining momentum in African countries and is often supported by external funding. Over the last decade, there has been an increased interest in African genomic data. The contribution of this rich data resource in understanding diseases predominant in both African and global populations has been limited to date. There has been some non-governmental funding dedicated to the advancement of genomic research and innovation by African-based and African-led research groups, but the impact of these initiatives is hard to quantify. However, there is now an opportunity for the global research community to leverage decades of genomic data and biospecimens originating from African populations. The experience we describe in this paper is of an access governance framework established under the Human, Heredity, and Health in Africa (H3A) consortium, given the task of managing wider access to the data and biospecimen resources collected via its various projects. The function of the Data and Biospecimen Access Committee (DBAC) is to facilitate the advancement of medicine and health while fostering the development of bioinformatics capabilities at Africa-based institutions or regional hubs. Our collective experiences and lessons learnt as a committee provide examples of nuanced considerations when evaluating access to African data. The committee was semi-autonomous in its establishment and had independence in decision-making. The DBAC continually advocates for the responsible use of genomic data and biospecimens that were obtained from African research participants, under broad consent, by primary researchers who no longer have oversight over the future use of these resources.

Summary box

  • International collaboration in genomic research is gaining momentum in African countries, often supported by external funding.

  • Over the last decade, interest in African genomic data generated under internationally funded projects project has increased, One of these projects, the Human, Heredity, and Health in Africa (H3A), established a Data and Biospecimen Access Committee (DBAC) to manage access to such resources.

  • In this paper we share the 5-years experience of the H3A- DBAC in promoting responsible governance and use of genomic data, fostering genomics and bioinformatics capacity-building in Africa, and ensuring return of benefit to participants.

Introduction

As Africa steps into the realm of genetic characterisation and the resultant generation of whole exome or genome sequence data for thousands of individuals, facilitating responsible access to genomic data is an ethical and scientific imperative. Generating genomic data from African populations has a significant benefit to the broader scientific community due to the extensive genetic diversity observed in African genomes. While it is in the best interests of Africans and their health that genomic data be accessible to researchers and healthcare professionals across the global community, access needs to be both facilitated and controlled.

Access to genomics data can be regulated via three possible approaches:

  • Unrestricted or public access: data are completely open and accessible for consultation and public download. This kind of access does not allow traceability and monitoring of the intended use of the data.

  • Registered access: requires the registration of the user to the repository before being able to download the data. This guarantees traceability and monitoring of the use of the data.

  • Controlled access: is the most stringent data access approach and requires credentialed users to apply for access to data through an intermediary body, which evaluates requests and allows access if a request complies with stated prerequisites.

For data repositories, there is not a ‘one-size-fits-all’ approach, and it is possible to have any mixture of the three types of access, where some datasets/variables are open and some require simple registration, while others may need a data access request. In determining which of these approaches are suitable for the African context considerations such as ease of access to data/samples, subsequent benefit sharing and vulnerability of relevant population groups should be treated as major issues associated with data and sample sharing. There is a need for governance to ensure appropriate access and use of data, to avoid introducing data errors into systems and to obviate potential misuse of personal and sensitive information.1

In controlled access, data access requests (DARs) are typically evaluated by a Data Access Committee (DAC). A DAC is a body composed of individuals with diverse expertise, removed from the processes of obtaining biological samples and genomic data generation. The Council for International Organisations of Medical Sciences guidelines2 state that DACs should have ‘representation from the original setting’ with diverse expertise that covers data management, ethics, analytics relevant research areas and a patient or data-sharing advocate.

DACs are responsible for data release to external requestors based on guidelines and policies that have been defined by the parties that collected the samples, produced the data and funded the process while respecting the regulations of the country or continental region from which the samples originated. Some authors have recommended that DACs should be established within institutional and legal frameworks with clear lines of accountability, terms of reference and membership.3 DACs assess the ethical soundness and scientific feasibility of the DARs and evaluate the qualification of applicants to ensure they are bona fide researchers. DACs can be either independent of the institution that produced the data or institutional, each scenario presenting advantages and disadvantages.4 It has been recognised that DACs play important roles in both the promotion of data sharing and the protection of all stakeholders (data subjects, communities, data producers, institutions where researchers are based, the country or even the continent of affiliation) by encouraging secondary uses of data.

In this paper, we share the experience of the Human Heredity and Health for Africa (H3Africa) DBAC in assessing and evaluating DARs for the genomics data that have been generated under the H3Africa project. We discuss the ways in which the work of this (or similar) DBACs can be improved to achieve the goals of simultaneous stimulation of genomics data production in Africa and the subsequent responsible wide sharing of these data.

H3Africa DBAC

H3Africa project

African genomics data have been very scarce to date and represent <2% of all genomics data available worldwide.5 6 To fill this gap, the H3Africa programme was launched in 2012 with funds from the NIH (USA) and the Wellcome Foundation (UK) in partnership with the African Society of Human Genetics. The H3Africa consortium facilitated fundamental research on genetic determinants of diseases on the African continent by developing human capacity, infrastructure (mainly for sequencing, biobanking and bioinformatics), resources and ethical guidelines, by funding 51 projects led by African scientists within the continent. Projects included population-based genomic studies of common non-communicable disorders, as well as infectious diseases. The programme ended in 2022 and resulted in the release of 23 datasets (table 1) and the collection of 23 421 biospecimens from 16 studies.7

Table 1
Summary of the datasets available in the H3Africa catalogue, accessible via the European Genome Archive (EGA) following the request to the H3A-DBAC (EGA identifier EGAC00001000648). Included is information on how many times the respective dataset has been requested, plus the countries from which the request originated

The H3Africa DBAC

All genomic and phenotypic research data generated by the H3Africa projects were securely stored at the H3Africa Data Archive, under the H3ABioNet, a Pan African bioinformatics network for the consortium, and then submitted to the European Genome-Phenome Archive (EGA), an upfront requirement stipulated by the funders. The decision to use EGA was based on the lack of suitable infrastructure on the continent for data storage, and the long history and expertise of the EGA with respect to handling this type and amount of data. Data access is controlled by the H3Africa Data and Biospecimen Access Committee (referred to as H3A-DBAC), which is in accordance with the EGA guidelines that allow any organisation, project or funder that deposits data to create their own DAC for controlled access (figure 1). Currently, 2432 active DACs operate under the EGA.8

Data and Biospecimen Access committee request processing flowchart.

The H3A-DBAC was formed in 2016, at the request of the H3A steering committee. The guidelines under which the DBAC should operate and considerations for data access were developed by a subcommittee of the consortium, the DBAC Working Group. There was subsequent input from the DBAC members towards refining certain guidelines for evaluating DARs. These guidelines are available in.9

The H3A-DBAC is currently composed of nine members from across the continent, with expertise in different fields: biobanking, data, genetics and genomics, ethics, law, patient advocacy, demography and epidemiology. The mandate of the committee is to ‘evaluate whether the requests conform to H3Africa policies and procedures including consistency of the proposed research use with the informed consent under which the data or biospecimen were collected and any other limitations stipulated by the submitting H3Africa investigators for each study’. While guidelines were initially stipulated by the steering committee, the DBAC acts independently as it adjudicates on requests. The committee also provides continued feedback to the DBAC-Working Group on aspects that need to be reconsidered, as they learn from their activities.

Challenges faced during the set-up of the H3A-DBAC

The H3A-DBAC is, to the best of our knowledge, the first DAC that was set up to regulate controlled access to African genomics data produced on the continent. The context of data and biospecimen sharing in Africa includes difficulties associated with identifying appropriate participants, consenting them properly and the preparation and storage of samples. In addition, processing samples, particularly genome characterisation, is costly, and the subsequent analysis comes with its own challenges. Thus, any sample and data-sharing governance would need to address these realities.

The key issues that required significant discussion were

  1. The requirement for the data requestor to return benefits to African populations from the intended secondary use of the data.

  2. The requirement for the data requestor to demonstrate evidence of a collaboration with African scientist(s), which was stipulated as being mandatory for biospecimen requests and recommended for data requests.9 The DBAC rationalised that this would promote capacity building on the continent through collaborative partnership.

  3. Access to data by commercial/non-profit organisations.

H3A DBAC: the experience of the first 5 years

The first DAR was received by the DBAC in December 2018. At the end of June 2023, a total of 28 DARs have been received and evaluated by the DBAC (figure 2a).

Number of requests received per year (a) and by geographic origin (b).

Origin of the requests

Seven DARs (25%) were from private companies, six from Africa, while 13 were from USA-based organisations (figure 2b).

Time taken to the first feedback on the DARs

When requests were made, their first contact is the H3A-DBAC secretariat, which quality checked the completeness of supporting documentation, after which the requests were forwarded to DBAC members for evaluation and decision. At the outset, the DBAC met quarterly, but this was changed since end of 2022 to monthly meetings to facilitate quicker turnaround times. The average time taken by the DBAC to examine the DARs and issue a first reply was 69 (20–210) days with a median of 57 days.

Number of datasets requested

Most of the DARs (19/28, 67.8%) requested three datasets or more, while two requested two datasets and seven requested a single dataset. Among these, four requested datasets relative to whole genome sequences, underlining the importance of such (scarce) data on African populations.

Decisions on the requests

The first decision of the DBAC for any request would be:

  • Approval: three requests were approved on the first review.

  • Conditional approval: this normally resulted from requests missing information that is required or DBAC members seeking clarification of some issues. The requester would be asked to either make a revision or supply the missing information and documents. The DBAC then carried out a second review to ascertain if concerns were addressed. Sixteen out of the 28 requests had been finally approved as of 30 June 2023. Most of the requests (18) received conditional approval as the first feedback, while seven were asked for major revision.

  • Rejection: a single DAR received an outright rejection.

From the requests that received a first decision of conditional approval or major revision, we examined the first decision letter sent by the DBAC secretariat based on the recommendations of DBAC members’ individual reviews, and the main reasons for the decision were noted. Four major categories of reasons for only conditional approval were noted: (1) research question and/or methodology needed clarification, (2) ethics certificates were not provided or needed updating, (3) benefits to Africa were not discussed by the applicant or were deemed insufficient or inadequate by the DBAC and (4) collaboration with African scientists was not evidenced. According to the DAR application form (section 6), the requester should justify the benefits to Africa and particularly how the proposed use of data and specimens could improve the health of the African population and contribute to the development of expertise and research capacity among African scientists. The DBAC9 guidelines also specify that the DBAC should check that the requester collaborated with an African scientist, although this is not considered a mandatory requirement.

Among the 25 requests that received the first feedback with conditional approval or revision, nine (36%) were asked to provide a clearer description of the research question and/or the methodology for data use and analyses. Half (50%) of the requests required clarification around ethical issues, with requirements ranging from providing a valid ethical clearance certificate from their host institution to clarifying the risk of stigmatisation or harm to African populations who provided the samples. Benefits to Africa were questioned in almost 40% of the DARs, while the lack of a clear African collaboration was raised in 61% of cases; these two issues often arose together (in 11 cases out of 25). All four issues were noted in 2/25 requests (and received a major revision decision). It is worth noting that the ethical issue comes as a single comment in six of the requests (figure 3).

A Venn diagram showing the categories of issues raised in the first feedback letter of DBAC to requesters (see text for further details).

DAR from private institutions required clarification on ‘Methodology’ significantly more frequently than DARs coming from the academic sector (71.4% vs 20%, respectively, Fisher’s Exact Test (FET) p value=0.023). The same trend is seen for the collaboration issue, with 100% of DARs from the private sector needing further input on this requirement, as opposed to 50% of DARs from the academic sector (FET p value=0.026). Conversely, no significant difference was seen for the Ethics and the Benefits issues (FET p=0.67 and p=0.084, respectively).

Discussion

Origin of requests

It is not surprising that most DARs came from the USA and UK, given that the UK and the USA funded the H3Africa project, and these two countries are active leaders in human genetics research. Moreover, most African PIs have a history of research collaboration with scientists in these two countries, and some of the work of H3A data generation (sequencing and genotyping) was done in collaboration with scientists from these countries.10

DARs reflect an awareness of the H3A African data, and its unique ability to add value to a diverse range of research applications, particularly the establishment of allele frequencies in various normal populations and its potential utility in resolving the pathogenicity of variants. It is hoped that with increasing publications from the secondary use, this will further promote and advertise the existence of the African data beyond only the USA and the UK. The few requests recently received from countries such as Japan, India and Brazil might indicate that publications from the H3Africa consortia are being noticed by a wider readership, particularly those appearing in high impact journals.10–12 The low number of requests from Africa itself might reflect the lack of computational capacity and/or expertise on the African continent. It is also possible that African scientists involved in data generation within the H3A consortium might have informally shared these data with their African collaborators directly.

Issues raised by DBAC about DARs

It is interesting to note that the nine DARs that received comments on three issues or more are equally distributed between the private and public sectors, indicating that the DBAC examined the DARs with similar criteria independently of their origin. However, it is significant to note that the methodology and the collaboration issue have been more frequently raised for DARs from the private sector. The methodology issue is likely explained by the fact that most DARs from the private sector have not provided clear description of the research question and/or the data analyses methods to be used, either intentionally, so as not to reveal some of the tools they have developed or the information they are looking for, or simply because of lack of awareness of the importance of this issue.

The collaboration issue was raised for all DARs from the private sector and 50% of DARs from the academic sector. This reflects the DBAC’s priority with respect to stimulating and ensuring collaboration with African scientists who produced these data and at least a minimal level of return of benefit to Africa. Although the collaboration with African scientists is not a mandatory requirement to have access to data, the DBAC have given this issue high importance by categorising collaboration with African scientists into six groups. If none of them exists, requesters are encouraged (in the feedback letter) to initiate collaboration. These six levels are (1) new or already existing collaboration (officially funded or not) with one African scientist as a Co-PI or collaborator; if the collaboration is ‘non-official’, a letter from the African scientist is acceptable evidence (2) common publications, (3) evidence of regular meeting or exchange, (4) training of African scientists within the requester’s lab or institution, (5) lectureship or a visiting fellowship of the requester to an African institution and (6) contribution to the infrastructure or capacity building in Africa.

It was noted that most of the requests seemed to overlook the potential group harm depending on how data were intended to be used and how findings were to be disseminated. Requesters were required to ensure responsible and sensitive dissemination in ways that did not stigmatise or spotlight certain groups. There is a history of exploitative research in Africa, and oversight bodies need to protect vulnerable populations.13 14 At the same time, research efforts which benefit these same populations should not be hampered by overly paternalistic committees, and equitable data sharing must be encouraged and facilitated. The WHO has set up the Technical Advisory Group on Genomics15 who, among other objectives, will shortly publish a document outlining guiding principles of human genome data sharing.

Conclusion

The H3Africa DBAC has played a role in ensuring that African genomes are responsibly interrogated for the benefit of the human population. The experience of this DBAC provides a template on which new projects, or consortia, can be built. In addition, the experience of this DBAC is now available to other upcoming initiatives and can provide advice or support, so that there is coordinated access to African data that are beneficial to the broader scientific community. It is hoped that increased availability of African genomics data, and its access to global researchers and companies, could benefit not only Africans, but humanity at large.

  • Handling editor: Helen J Surana

  • X: @codan

  • Contributors: AR drafted the manuscript, analysed the data and produced table and figures 2 and 3. RW collected the data. KH produced figure 1. AA, PA, DB, KH, RK, JM and CD and AR conceptualised and edited the manuscript, revised it critically for important intellectual content and made substantial contribution to the concept or design of the article or acquied, analysed or interpreted data for the article. All authors reviewed the results, approved the final version of the manuscript and agreed to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. AR is the guarantor for the work.

  • Funding: the H3Africa DBAC is an independent data governance body that has been funded by Wellcome foundation under the H3Africa project (Human Heredity and Health in Africa (H3Africa) Initiative (Phase II). Grant Number: 205165/B/16/Z.

  • Competing interests: None declared.

  • Provenance and peer review: Not commissioned; externally peer reviewed.

Data availability statement

All data relevant to the study are included in the article or uploaded as supplementary information. All data relevant to the study are included in the article.

Ethics statements

Patient consent for publication:
Ethics approval:

Not applicable.

Acknowledgements

The authors would like to thank Tumani Corrah and Alliance Nikuze as former members of the H3Africa Data and Biospecimen Access Committee (DBAC) for their contribution regarding the work that allowed establishment of DBAC activities, and Christine Wasuna, Ayaga Bawah, Cornelius Debpuur and Phyliss Babirye as current members of the DBAC for their comments on the paper. We are also grateful to the H3Africa steering committee for the rich discussion, guidance and interactions during and after the setup of the DBAC. We would particularly like to thank Michelle Skelton for her continued interaction with the DBAC. The H3Africa DBAC has been funded by the H3Africa consortium and sustained by the Science for Africa Foundation.

  1. close Chandramohan D, Shibuya K, Setel P, et al. Should data from demographic surveillance systems be made more widely available to researchers? PLoS Med 2008; 5.
  2. close Shabani M, Thorogood A, Borry P, et al. Who should have access to genomic data and how should they be held accountable? Perspectives of Data Access Committee members and experts. Eur J Hum Genet 2016; 24:1671–5.
  3. close Cheah PY, Piasecki J. Data Access Committees. BMC Med Ethics 2020; 21.
  4. close The H3Africa Consortium. Enabling the genomic revolution in Africa. Science 2014; 344:1346–8.
  5. close Wonkam A. Sequence three million genomes across Africa. Nature New Biol 2021; 590:209–11.
  6. close Choudhury A, Aron S, Botigué LR, et al. High-depth African genomes inform human migration and health. Nature New Biol 2020; 586:741–8.
  7. close Fortes-Lima CA, Burgarella C, Hammarén R, et al. The genetic legacy of the expansion of Bantu-speaking peoples in Africa. Nature New Biol 2024; 625:540–7.
  8. close Chennells R, Steenkamp A. International genomics research involving the san people, Ethics dumping case studies from North- South Research Collaborations. Springer Briefs in Research and Innovation Governance 2018;
  9. close Moodley K, Cengiz N, Domingo A, et al. Ethics and governance challenges related to genomic data sharing in southern Africa: the case of SARS-CoV-2. Lancet Glob Health 2022; 10:e1855–9.

Crossmark logoCrossmark logo
  • Received: 23 April 2024
  • Accepted: 19 December 2024
  • First published: 8 February 2025

Altmetric

Altmetric scores reflect online attention articles receive, shown as a central number with coloured threads for each source

Dimensions

Dimensions shows where and when an article has been cited, helping readers understand its academic influence over time