Archival Data Repositories

November 19, 2018 at 4-5:30pm in BIDS, 190 Doe Library

Welcome!

Speakers

Joshua Quan - Data Librarian @ UC Berkeley Library, D-Lab

Content

This DATS session will introduce archival data repositories researchers might be interested in using to discover datasets or depositing their own data and code for long-term archiving for others to discover. We will cover Dataverse, Dash/Dryad, and Zenodo.

Objectives:

Data Repository Defined

From Registry of Research Data Repositories: “subtype of a sustainable information infrastructure which provides long-term storage and access to research data that is the basis for a scholarly publication. Research data means information objects generated by scholarly projects for example through experiments, measurements, surveys or interviews.”

…So it’s a place to put your data and analysis scripts that will be accessible beyond the life of a research project, grant, or individual career.

A minimum rationale for depositing/sharing…

  1. Sharing your data gives you credit for your work that everyone can see

  2. Your hard work will persist and be discoverable

… fulfills the most basic components of F.A.I.R principles for scientific data

Things to Consider when choosing a Repository

Reputation

Sustainability

Visibility

Usability

Features

Formats

Rights

General vs. Subject Specific Repositories

General repositories

-some cool ideas floating around

Subject repositories

APIs + Wrappers

Zenodo(R)
PyZenodo(Python)

Dataverse(R)
Dataverse-client(Python)

Github Search API

Dataverse Walk-through

On your own

Using the Comparative Overview of Features document as a template, think about your own research and the kind of repository (general vs. specific) that makes the most sense for your archival needs.

Contacts

https://researchdata.berkeley.edu/

http://dlab.berkeley.edu/

https://www.cdlib.org/services/uc3/dash.html

Share