Workshop on Research Objects 2019 (RO2019)

Workshop at IEEE eScience 2019 — http://www.researchobject.org/ro2019/

Location: San Diego, CA, US
Venue: IEEE eScience 2019 (fees applied)
When: Tuesday 24 September 2019 (2019-09-24)
Duration: 1 day
Previous workshop: RO2018
Hashtags: #eScience2019 #ResearchObjects (plural!)

Resources
Workshop Organisers
Description
Submissions
Timeline
Keynote speaker
Short bio of the organizers
Program Committee

News

RO2019 workshop was held at eScience 2019, San Diego
Workshop Programme announced
Keynote Speaker announced
Proceedings published
Open Peer Reviews published
Twitter hashtags: #ResearchObjects #eScience2019

Resources

Workshop Organisers

Carole Goble (The University of Manchester, UK)
Raul Palma (Poznan Supercomputing and Networking Center, Poland)
Stian Soiland-Reyes (The University of Manchester, UK; Apache Software Foundation)
Daniel Garijo (University of Southern California, US)

Description

Scholarly Communication has evolved significantly in recent years, with an increasing focus on Open Research, FAIR data sharing and community-developed open source methods. The concepts of authorship and citation are changing, as researchers are increasingly reusing and evolving common software tools and datasets. Yet with a growing amount of cloud compute power and open platforms available, reproducibility of computational analyses becomes more challenging, and not yet commonly included in peer review. While recent advances in scientific workflows and provenance capture systems have improved on this situation, a question remains on how to publish, archive, explore and understand digital research outputs, as academic authors and publishers remain focused on PDFs and the occasional CSV file, with the Web and Open Research often left to “best effort” rather than being the expected norm.

A number of community initiatives have begun to explore how to package various multi-part research outcomes with their context, how to handle distributed and living content and how to port and safely exchange these “Research Objects” between platform and between researchers.

One such approach is Research Object which has proposed a way to package and describe research outputs, data, methods, workflows, provenance and structured metadata, reusing existing Web standards and formats. Research Objects, and Research Object-like approaches have gathered pace across:

Scientific domains - bioinformatics, systems biology (e.g. COMBINE Archives), health informatics (e.g. BioComputeObject);
Tasks - handling big data (BDBags), reproducible workflows (Common Workflow Language), scholarly communication and publishing (Dryad, dataONE);
Platforms:
- Virtual Research Environments, e.g. the EVER-EST VRE for Earth Sciences;
- Community output aggregators such as the EU’s OpenAIRE, FAIRDOMHub, CodeOcean and Open Science Framework.
Stakeholders
- Publishers - such as eLife’s Reproducible Document Stack project and science.ai,
- Repository providers such a dataONE and Dryad and Mendeley Data;
- Funders, such as the EU (European Open Science Cloud) and NIH (e.g. Data Commons)

However, many challenges remain as to how to increase Research Object uptake with data providers, researchers, infrastructures, publishers and other stakeholders; credit and tracking metrics; develop supporting tooling; building effective community efforts and the relationship of rich metadata manifests with emerging container platforms.

In this workshop, following on from the successful RO2018 held at eScience 2018, we explored recent advancements in Research Objects and publishing of research data.

This workshop aimed for a mix of presentation sessions and unconferencing sessions including:

Keynotes and invited talks
Presentations selected from short article submissions and extended abstracts
Demonstrations and (informal) poster session
2-minute Lightning talks for ad-hoc presentations and hot topics
Break-out sessions to further build relationships across scientific domains and RO practitioners.

To suggest lightning talks, demos, discussion topics and break-out sessions, workshop participants could add to the RO2019 collective notes or add to GitHub.

Submissions

For RO2019 we welcomed submissions for Short articles (about 4-8 pages), Abstracts for oral communication (1-2 pages) and Poster and demo abstracts (1-2 pages).

RO2019 practiced open peer review, and recommended that reviewers are named and attributed; however reviewers could be anonymous if so desired.

Please see our submission guide for details. Authors could then either:

Timeline

~~2019-07-05 RO2019 submissions due: articles & oral presentation~~
~~2019-07-15 RO2019 submissions due: abstracts for oral presentation~~
~~2019-07-27 RO2019 Notification of acceptance~~
~~2019-07-30 Preliminary RO2019 Programme~~
~~2019-08-30 Camera-ready papers due for accepted articles in (IEEE eScience proceedings)~~
~~2019-09-02 RO2019 submissions due: poster/demo abstracts~~
~~2019-09-09 RO2019 Programme finalized~~
2019-09-24 RO2019 at IEEE eScience 2019

All accepted submitters were requested to present at the workshop and register for IEEE eScience 2019, where fees apply.

Keynote speaker

Bertram Ludäscher: The keynote speaker was Bertram Ludäscher, who presented: From Research Objects to Reproducible Science Tales.

Bertram Ludäscher is a professor at the School of Information Sciences at the University of Illinois, Urbana-Champaign, where he directs the Center for Informatics Research in Science and Scholarship (CIRSS). He is also a faculty affiliate with the National Center for Supercomputing Applications (NCSA) and the Department of Computer Science at Illinois. Until 2014 he was a professor at the Department of Computer Science at the University of California, Davis.

His research interests range from practical questions in scientific data and workflow management, to database theory and knowledge representation and reasoning.

Prior to his faculty appointments, he was a research scientist at the San Diego Supercomputer Center (SDSC) and an adjunct faculty at the CSE Department at UC San Diego. He received his M.S. (Dipl.-Inform.) in Computer Science from the University of Karlsruhe (now K.I.T.), and his PhD (Dr. rer. nat.) from the University of Freiburg, in Germany.

Short bio of the organizers

Carole Goble (The University of Manchester, UK)

Carole Goble heads researchobject.org along with her team in the eScience Lab at the University of Manchester, whose mission is to disseminate knowledge about research objects, their concept and their adoption. She has spent 25 years working in e-Science on computational workflows, reproducible science, open sharing, and knowledge and metadata management in a range of disciplines.

She co-established the myExperiment.org workflow repository and the FAIRDOMHub.org for systems biology asset sharing. She is the co-lead of the interoperability platform for ELIXIR, the EU Research Infrastructure for Life Sciences, Head of Node of ELIXIR-UK and co-founder of the UK’s Software Sustainability Institute. She has keynoted twice for IEEE e-Science (2005, 2012). In 2008 she was honoured by Microsoft Jim Gray award for outstanding contributions to eScience.

https://orcid.org/0000-0003-1219-2137

Raul Palma (Poznan Supercomputing and Networking Center, Poland)

Raul Palma is the semantic technologies coordinator in the network services division at Poznan Supercomputing and Networking Center (PSNC). He has more than twelve years of experience in research and development in different areas related to Artificial Intelligence, such as knowledge representation, discovery and reasoning, Open and Linked Data, and the application of Semantic technologies in different domains.

He has been involved in the development of the Research Object model and supporting technologies since their conception, and has been devoted to their dissemination, adoption and exploitation in different domains. He has led the development team of the research object management platform ROHub (http://www.rohub.org/), built entirely around the research object model and inspired by sustainable software management principles.

Currently, Raul is leading PSNC activities in EVER-EST project, where Research Objects and ROHub have been used as the cornerstone to build a Virtual Research Environment for Earth Sciences. An ACM member and former MAE-AECI scholarship holder, Raul holds a Ph.D. in Computer Science and Artificial Intelligence (cum laude) and regularly publishes and reviews for top scientific conferences and journals in the area.

https://orcid.org/0000-0003-4289-4922

Stian Soiland-Reyes (The University of Manchester, UK; University of Amsterdam, NL; Apache Software Foundation)

Stian Soiland-Reyes is a senior Research Software Engineer, working at the eScience Lab in the University of Manchester since 2006. His research and development interests are in reproducible Open Science by applying semantic technologies and distributed computing. He is a persistent advocate of Open Access, data sharing and improving practices of academic publishing. As a keen open source developer and Foundation Member of Apache Software Foundation, he has a key role in development of the workflow system Apache Taverna (incubating) and Common Workflow Language (he is on the CWL leadership team), as well as contributing to Linked Data initiatives such as Commons RDF, Jena and JSON-LD.

Stian was a key participant in the Wf4Ever project, where he co-led specifications for preserving and publishing workflow-based Research Objects. He is a co-author of the W3C PROV-O standard for provenance as well as the PAV provenance ontology, he has contributed to the W3C Web Annotation Data Model, ORCID and OAI-ORE. Stian currently work with the BioExcel Centre of Excellence with attention to interoperable workflows in HPC and HTC environments for biomolecular simulation and modelling.

https://orcid.org/0000-0001-9842-9718

Daniel Garijo (University of Southern California, US)

Daniel is a researcher at the Information Sciences Institute of the University of Southern California and along term collaborator with the Ontology Engineering Group at the Artificial Intelligence Department of the Computer Science Faculty of Universidad Politécnica de Madrid.

His research activities focus on e-Science and the Semantic Web, specifically on how to increase the understandability of scientific workflows using their provenance (i.e., steps, outputs, inputs, intermediate results) and exposing them as Linked Data.

Daniel has participated in standardization efforts and collaborated with different researchers throughout his career, which have led to significant contributions to my PhD work. He has been involved in the development of the Research Object model and its ontologies since the concepts inception in the EU project Wf4ever.

https://orcid.org/0000-0003-0454-7145

Program Committee

Eoghan Ó Carragáin (University College Cork, Ireland)
Frederik Coppens (VIB, Belgium)
Oscar Corcho (Universidad Politécnica de Madrid, Spain)
David De Roure (University of Oxford and Alan Turing Institute, UK)
José Manuel Gómez Pérez (expertsystem.com, Spain)
Farah Zaib Khan (The University of Melbourne, Australia)
David Koop (University of Massachusetts Dartmouth, USA)
Dimitris Koureas (Naturalis, NL)
Paolo Manghi (Open AIRE and CNR, Pisa Italy)
Raja Mazumder (BioComputeObjects, George Washington University, USA)
James Myers (SEAD project, University of Michigan, USA)
Denis Parfenov (open knowledge, Ireland)
Peter Sefton (University of Technology Sydney, Australia)
Anita de Waard (Elsevier)