From the NIH: Sharing, Discovering, and Citing COVID-19 Data and Code

COVID-19 data and code resources in generalist repositories

On April 24, the NIH Office of Data Science Strategy and National Library of Medicine hosted a webinar on the Sharing, Discovering, and Citing COVID-19 Data and Code in Generalist Repositories. This webinar featured valuable information from Vivli, figshare, GitHub, Dryad, Zenodo, Harvard Dataverse, and Mendeley Data.

Recording of the Webinar and presentations from each repository.

Here are some features for each of these platforms:

Vivli COVID-19 Portal

  • Vivli promotes, coordinates, and facilitates scientific sharing and reuse of clinical research data
  • Vivli COVID-19 portal provides a dedicated search function and availability of fast-tracked review and sharing
  • Waiver of all fees to share, archive, access, and analyze COVID-19 trials
  • Waiver of anonymization fees through key Vivli partners
  • How to share your clinical research data on the Vivli COVID-19 Portal
     

COVID-19 Data in Harvard Dataverse

  • Dataverse is an open-source software with a growing, active community
  • Harvard Dataverse is one of the 56 Dataverse repositories world-wide
  • 50 COVID-19 datasets with 2,043 data files deposited since February 7
  • Total of 48,471 downloads by April 23
  • Include datasets on: COVID-19 statistics, social science studies to evaluate the effectiveness of government measures, survey data, Twitter data, gubernatorial responses, replication data for preprints
  • COVID-19 Data Collection
  • Resources for COVID-19
     

PubMed Central and COVID-19

  • Leveraging the literature as a dataset during a pandemic
  • Call for publishers to agree to make their COVID-19 and coronavirus-related publications, and the available data supporting them, immediately accessible in PubMed Central and other appropriate public repositories
  • More than 38,000 coronavirus-related articles deposited with licenses that allow re-use and secondary analysis
  • More than 3,000 COVID-19 specific articles deposited
  • COVID-19 Open Research Dataset Challenge
  • COVID-19 Open Research Dataset (CORD-19)
     

Zenodo: COVID-19 Response

  • Zenodo is multi-disciplinary and supports good sustainable data management practice
  • Prioritization of COVID-19 requests
  • Quota increases beyond default
  • 1-on-1 with Zenodo supporters
  • Scripts for automated uploading of larger datasets
  • Curation of COVID-19 records
  • Coronavirus Disease Research Community - COVID-19
     

COVID-19 Open Repo Data on GitHub

  • Began publishing COVID-19 Open Data about projects on GitHub
  • Aid in the discoverability of important solutions being developed by our user community
  • Reduce the collaboration barrier while minimizing duplicate effort in a time of crisis
  • 50K+ repos with engagement by 1M+ distinct users, and viewed 40M+ times
  • See all COVID-19 repo data in GitHub
     

Figshare: Sharing, Discovering, and Citing COVID-19 Data and Code

  • Figshare is an online open access repository where researchers can preserve and share their research outputs, including figures, datasets, images, and videos.
  • Open science and the timely sharing of research data have played a critical role in advancing our understanding of COVID-19 and accelerating the pace of discovery.
  • NIH figshare instance
  • COVID-19 data in figshare
     

Dryad: Responding to & Aligning with an Evolving Research Landscape

COVID-19 and Mendeley Data

List of open-access data and computational resources that address COVID-19. To stay current on data science activities at NIH, be sure to bookmark the NIH website and follow them on Twitter with #NIHdata.