Skip to content Skip to footer

COMPSs

COMPSs logo

COMP Superscalar (COMPSs) is a task-based programming model which aims to ease the development of applications for distributed infrastructures, such as large High-Performance Clusters (HPC), Clouds and Container managed clusters. PyCOMPSs is the Python binding of COMPSs.

COMPSs provides a programming interface for the development of applications in Python/Java/C/C++, a runtime system that exploits the inherent parallelism of applications at execution time, and a rich ecosystem for the operation, monitoring, performance evaluation and integration with Jupyter/Jupyterlab.

The COMPSs runtime includes the capacity of automatically recording details of the application’s execution as metadata, also known as Workflow Provenance. The metadata is recorded in RO-Crate format, following Workflow RO-Crate and Workflow Run Crate profiles. With workflow provenance, you are able to share not only your workflow application (i.e. the source code) but also your workflow run (i.e. the datasets used as inputs, and the outputs generated as results).

Provenance information can be useful for a number of things, including Governance, Reproducibility, Replicability, Traceability, or Knowledge Extraction, among others. In our case, we have initially targeted workflow provenance recording to enable users to publish research results obtained with COMPSs as artifacts that can be cited in scientific publications with their corresponding DOI, by using WorkflowHub. Both workflow provenance metadata and its publication in WorkflowHub enable the reproducibility of the workflows.

COMPSs with RO-Crate

Examples of COMPSs RO-Crates

Plenty of examples of COMPSs Workflows with enabled provenance recording can be found at WorkflowHub (filtering the browsing by ‘COMPSs’ workflow type).

In addition, the COMPSs User Manual has a dedicated section on how to generate Workflow Provenance with COMPSs.

Resources

Publications

Raül Sirvent, Javier Conejero, Francesc Lordan, Jorge Ejarque, Laura Rodríguez-Navas, José M Fernández, Salvador Capella-Gutiérrez, Rosa M Badia (2022):
Automatic, Efficient and Scalable Provenance Registration for FAIR HPC Workflows.
IEEE/ACM Workshop on Workflows in Support of Large-Scale Science (WORKS) (1-9) https://doi.org/10.1109/WORKS56498.2022.00006
[preprint]

Leo, S., Crusoe, M. R., Rodríguez-Navas, L., Sirvent, R., Kanitz, A., De Geest, P., … & Soiland-Reyes, S. (2024): Recording provenance of workflow runs with RO-Crate. PLoS ONE 19(9): e0309210. https://doi.org/10.1371/journal.pone.0309210