RO-Crate 1.0 specification released

By Stian Soiland-Reyes

The researchobject.org community recommendation RO-Crate 1.0 has been released.

RO-Crate (Research Object Crate) specifies a method of organizing file-based data with associated metadata, using Linked Data principles, in both human and machine readable formats, with the ability to include additional domain-specific metadata.

The core of RO-Crate is a JSON-LD file, the RO-Crate Metadata File, named ro-crate-metadata.jsonld. This file contains structured metadata about the dataset as a whole (the Root Data Entity) and, about some or all of its files within a data package. This provides a simple way to, for example, assert the authors (e.g. people, organizations) of the RO-Crate or one its files, or to capture more detailed provenance and metadata for files, such as how they were created using contextual entities such as software, equipment, geographical places, funders and subjects.

The RO-Crate specification is based on schema.org, providing an opinionated specification of its use to describe Research Objects, with practical use guides and examples for software authors to create tools for generating and consuming research data packages.

Building community consensus

RO-Crate is a fresh initiative, bringing together data archive and repository maintainers with existing Research Object, workflow and provenance communities. Starting as a small cross-domain group, organically formed to build the core principles and first sketches of their use, we are now expanding to collect use cases and reaching out to other packaging initiatives to build common ground.

For the eScience LAb, one emerging use of RO-Crate is for capturing workflows and tools in a federated workflow repository being built in EOSC-Life, a large European Open Science Cloud project across 13 research infrastructures in the life science domain. However RO-Crate is also aiming to be usable by individual Data Scientists and Digital Library efforts with no particular infrastructure beyond Jupyter notebook, for developers who may not have the time or motivation to use a cascade of metadata vocabularies and research data management tools.

RO-Crate development and discussion is done openly in a GitHub repository by volunteers, with monthly telcons to synchronize the effort. Anyone can join to help form the RO-Crate approach.

Cite RO-Crate

Eoghan Ó Carragáin; Carole Goble; Peter Sefton; Stian Soiland-Reyes (2019):
A lightweight approach to research object data packaging. Bioinformatics Open Source Conference (BOSC2019) https://doi.org/10.5281/zenodo.3250687

Peter Sefton, Eoghan Ó Carragáin, Stian Soiland-Reyes, Oscar Corcho, Daniel Garijo, Raul Palma, Frederik Coppens, Carole Goble, José María Fernández, Kyle Chard, Jose Manuel Gomez-Perez, Michael R Crusoe, Ignacio Eguinoa, Nick Juty, Kristi Holmes, Jason A. Clark, Salvador Capella-Gutierrez, Alasdair J. G. Gray, Stuart Owen, Alan R Williams, Giacomo Tartari, Finn Bacall, Thomas Thelen (2019):
RO-Crate Metadata Specification 1.0. researchobject.org Community Recommendation. https://doi.org/10.5281/zenodo.3541888 https://w3id.org/ro/crate/1.0

This news item has been adapted from the an abstract at *Workshop on Research Objects 2019* at *eScience 2019*, see https://doi.org/10.5281/zenodo.3337883 for details.

Example RO-Crate Metadata File

{"@context": "https://w3id.org/ro/crate/1.0/context",
 "@graph": [
  { "@id": "ro-crate-metadata.jsonld",
    "@type": "CreativeWork",
    "conformsTo": {"@id": "https://w3id.org/ro/crate/1.0"},
    "about": {"@id": "./"}
  },
  {
    "@id": "./",
    "@type": "Dataset",
    "identifier": "https://doi.org/10.5281/zenodo.3541888",
    "name": "RO-Crate specification dataset",
    "version": "1.0.0",
    "license": { "@id": "https://www.apache.org/licenses/LICENSE-2.0"},
    "datePublished": "2019-11-15",
    "hasPart": [
        {"@id": "ro-crate-1.0.0.html"},
        {"@id": "ro-crate-context-1.0.0.jsonld"}
    ]
  },
  { "@id": "ro-crate-1.0.0.html",
    "@type": ["CreativeWork","File"],
    "encodingFormat": "text/html",
    "name": "RO-Crate Metadata Specification 1.0",
    "identifier":  {"@id": "https://w3id.org/ro/crate/1.0"},
    "version": "1.0"
  },
  {
    "@id": "ro-crate-context-1.0.0.jsonld",
    "@type": "File",
    "encodingFormat": "application/ld+json",
    "name": "RO-Crate JSON-LD Context",
    "identifier": {"@id": "https://w3id.org/ro/crate/1.0/context"},
    "license": {"@id": "https://creativecommons.org/publicdomain/zero/1.0/"},
    "author": [
        { "@id": "https://orcid.org/0000-0002-3545-944X"},
        { "@id": "https://orcid.org/0000-0001-9842-9718"}
    ]
  },
  { "@id": "https://orcid.org/0000-0002-3545-944X",
    "@type": "Person",
    "name": "Peter Sefton"
  },
  { "@id": "https://orcid.org/0000-0001-9842-9718",
    "@type": "Person",
    "name": "Stian Soiland-Reyes"
  }
 ]
}