Data sharing improves the value, synthesis, and integrity of research, but rates are low. Data sharing might be improved if data sharing policies were prominent and actionable at every stage of research. We aimed to systematically describe the epidemiology of data sharing policies across the health research lifecycle.
This was a cross-sectional analysis of the data sharing policies of the largest health research funders, all national ethics committees, all clinical trial registries, the highest-impact medical journals, and all medical research data repositories. Stakeholders' official websites, online reports, and other records were reviewed up to May 2022. The strength and characteristics of their data sharing policies were assessed, including their policies on data sharing intention statements (a.k.a. data accessibility statements) and on data sharing specifically for coronavirus disease studies. Data were manually extracted in duplicate, and policies were descriptively analysed by their stakeholder and characteristics.
Nine hundred and thirty-five eligible stakeholders were identified: 110 funders, 124 ethics committees, 18 trial registries, 273 journals, and 410 data repositories. Data sharing was required by 41% (45/110) of funders, no ethics committees or trial registries, 19% (52/273) of journals and 6% (24/410) of data repositories. Among funder types, a higher proportion of private (63%, 35/55) and philanthropic (67%, 4/6) funders required data sharing than public funders (12%, 6/49).
Data sharing requirements, and even recommendations, were insufficient across health research. Where data sharing was required or recommended, there was limited guidance on implementation. We describe multiple pathways to improve the implementation of data sharing. Public funders and ethics committees are two stakeholders with particularly important untapped opportunities.