Dataset2RDF
The SPHN Dataset2RDF is a Python tool developed by the DCC.
The tool can be use to translate semantics defined in the SPHN Dataset into an RDF Schema.
Since 2023, the Dataset2RDF is used by the DCC to generate the SPHN RDF Schema (i.e. the 2023.2 Release onwards).
Introduction
The Dataset2RDF translates the concepts and composedOfs defined in the SPHN Dataset into a formal representation using RDF, RDFS, and OWL. The output of the Dataset2RDF tool is an SPHN-compliant RDF Schema.
Usage
The Dataset2RDF is built to support two types of scenarios:
SPHN Dataset: In this scenario, the input is the SPHN Dataset XLSX and the output is the SPHN RDF Schema as a TTL file.
Project-specific Dataset: In this scenario, the input is a project-specific Dataset XLSX, adapted from the Dataset Template
SPHN Dataset
The Dataset2RDF tool can parse the SPHN Dataset XLSX as follows:
dataset2rdf --input SPHN_Dataset.xlsx \
--output sphn_rdf_schema.ttl \
--config dataset2rdf/config.yaml
The tool takes as input the SPHN Dataset XLSX and a config.yaml
, and generates a RDF Schema TTL.
Project-specific Dataset
The Dataset2RDF tool can parse also parse a project-specific Dataset XLSX as follows:
dataset2rdf --input Project_Dataset.xlsx \
--output sphn_rdf_schema.ttl \
--project-output project_specific_rdf_schema.ttl \
--config dataset2rdf/config.yaml
The tool takes as input the a project-specific Dataset XLSX and a config.yaml
, and generates project-specific RDF Schema TTL.
Note
To generate the project-specific Dataset you should make use of the SPHN Dataset Template and adapt it for your project needs. For more information refer to Generate a project-specific RDF Schema.
Availability and usage rights
© Copyright 2023, Personalized Health Informatics Group (PHI), SIB Swiss Institute of Bioinformatics
The Dataset2RDF is available at https://git.dcc.sib.swiss/sphn-semantic-framework/dataset2rdf licensed under the GPLv3 license.
For any question or comment, please contact the SPHN Data Coordination Center (DCC) at dcc@sib.swiss.