Appendix: Finding Open Data

The reusability of openly shared data relies on the prospects of it being found in the first place, therefore data findability is a key step in accessing and utilizing data. There are three major ways to find Open Data that are shared by researchers – repository, web search, and literature search.

Repositories

Ideally, Open Data should be available in repositories where the datasets are properly indexed and assigned a unique persistent identifier (as discussed in Lesson 6 – Sharing Open Data) thereby ensuring the data is unambiguously identifiable, searchable, discoverable along with associated metadata and documentations.

Therefore, the first step in finding Open Data related to your field is to identify discipline specific repositories (if there are any) and search for datasets there (see Lesson 6.4 – Repositories and Other Sharing Methods).

Find repositories in your field:

  • Re3data.org is a global registry of research data repositories that covers research data repositories from different academic disciplines.
  • FAIRsharing is a curated, informative, and educational resource on data and metadata standards, inter-related to databases and data policies.
  • Recommended repositories by publishers (e.g., Recommended Data Repositories suggested by Scientific Data andPLOS One)
  • World Data System represents a network of repositories.

Examples of generic repositories:

TheGeneralist Repository Comparison Chart is a tool you can use to decide where to store and share their FAIR data outside of their institutional repositories. Dataverse has also published acomparative review of eight data repositories.

Web-searches

To explore a wide variety of datasets from projects or popular topics, the use of a more general search engine can be helpful. Some disciplines or large institutions such as NASA and the National Institute of Health’s National Center for Biotechnology Information (NCBI) offer their own portal where you can search for their datasets, related publications and oftentimes tools for analysis (e.g., EMBL’s European Bioinformatics Institutehttps://www.ebi.ac.uk/ ). There are also an increasing number of international and national data portals to enable data discoveries.

Generic data search portals:

Examples of Discipline specific:

Examples of National or international data portal