The Royal Society of Chemistry believes that, where possible, all data associated with the research in a manuscript should be Findable, Accessible, Interoperable, and Reusable (FAIR), enabling other researchers to replicate and build on that research. We then expect that authors submit both their code and data to community-recognized data repositories or a general repository if no community resource is available. Referees have to be able to access code and data during the peer-review process, and public release should be coordinated with the publication of the manuscript.
If authors are unable to publicly share their code and/or data due to employer policies or previous IP arrangements, they still need to upload as much data and tools as possible (datasets, analysis scripts, etc.) and detail the reasons for the omission of other resources in the Data Availability Statement.
Data availability statements
In order to maintain high standards of transparency, research reproducibility, and to promote the reuse of new findings, Digital Discovery requires authors to submit a Data Availability Statement upon their article submission.
Data availability statements (DAS) provide information about where data, software or code supporting the results reported in a published article can be found. These should include, where applicable, links to code and datasets shared in an external data repository, which have been analysed or generated during the study. This section should list the database, accession number, DOI, URL or any other relevant details. In case of code deposition, the exact version and date of access to a repository have to be described for future reproducibility. If deposited in a public repository associated codes and datasets are required to be formally cited in the reference section of an article.
The DAS can provide information about the data presented in an article (e.g. in Figures or Tables) or provide a reason if code or data is not available to access. If supporting data or code have been included in the article’s electronic supplementary information (ESI), this should also be stated here.
A Data Availability Statement is required to be submitted alongside all manuscript submissions as an accompanying file. It will be published as part of the articles accepted for publication and can be found by readers following the Conclusions of the paper. Examples of Data Availability Statements which can be used are shown below:
1. The code for [description] can be found at [URL] with [DOI – if available]. The version of the code employed for this study is version XXXX.
2. Data and processing scripts for this paper, including [description of data types] are available at [name of repository] at [URL – format https://doi.org/DOI].
3. This study was carried out using publicly available data from [name of repository] at [URL] with [accession number].
4. Alternatively: The code, analysis scripts, and datasets supporting this article have been uploaded as part of the supplementary material.
5. The data analysis scripts of this paper are available in the interactive notebook X [Example, Google Collab].
6. As this is a [Review/Perspective article], no primary research results, data, software or code have been included.
The following statement is generally not acceptable: “Data are available upon request from the authors".
Choosing a repository
The Royal Society of Chemistry supports the TRUST principles for digital repositories - Transparency, Responsibility, User focus, Sustainability and Technology - for repository selection and strongly encourages the use of repositories offering persistent identifiers, such as DOIs, for deposited datasets. These help make robust connections between datasets and papers, e.g. via the inclusion of a Data Availability Statement.
The following websites may help for searching and selecting subject specific repositories.
Software / code can be deposited in the repositories listed below:
- GitHub Please also consider archiving code in combination with a repository that can issue a DOI. For example: https://guides.github.com/activities/citable-code/
- Code Ocean
Where subject specific or institutional/funder repositories are not available, authors may wish to choose a general repository, such as:
Dryad Digital Repository - Fees apply
figshare - Fees apply
Harvard Dataverse - Contact repository for datasets over 1 TB
Open Science Framework - Free of charge
Science Data Bank - Free of charge
Zenodo - Donations towards sustainability encouraged