OpenPVSignal

An ontology facilitating the publishing of pharmacovigilance signal information

View the Project on GitHub inab-certh/OpenPVSignal

Advancing Information Search, Sharing and Reuse on Pharmacovigilance Signals via FAIR Principles and Semantic Web Technologies

OpenPVSignal is an ontology aiming to foster the publication of Pharmacovigilance (PV) Signal information, which is currently communicated by drug regulatory authorities via their newsletters or web sites as free-text reports.

OpenPVSignal relies on Linked Data and Semantic Web technologies, and conforms to FAIR data principles, aspiring to advance search, sharing and reuse of information about PV Signals.

A scientific paper describing the OpenPVSignal design, development and validation has been published in “Frontiers in Pharmacology” and can be found here.

Moreover, an article about OpenPVSignal was published in Uppsala Reports, the news magazine concerning the latest issues in medicines safety released by Uppsala Monitoring Centre, and it can be accessed here.

Based on the OpenPVSignal ontology, a Knowledge Graph (KG) has been built, including 101 PV Signal reports that have been published between 2012 and 2019 by the Uppsala Monitoring Center. These datasets have been manually converted to individual KGs and have been through an iterative quality control process.

In this repository you may find:

  1. The OpenPVSignal ontology model.
  2. The OpenPVSignal web page.
  3. Indicative instantiations of the OpenPVSignal ontology contained in the examples folder, concerning 3 signal reports communicated via:
  4. The KG in turtle format.
  5. The SHACL data models used to validate the KG dataset.
  6. An R based script that executes the KG alidation pipeline as a whole.

Regarding the building of the OpenPVSignal KG, one of the final stages was the validation of data schema that was performed using the SHACL Shapes Constraint Language, a language for validating RDF graphs against a set of conditions[1].

The pySHACL python library is used to run the SHACL tests and an R script is used to create the summarized reports

pip install pyshacl

after installing R, navigate to the folder where OpenPVSignal repository has been cloned

Rscript run_com.r

The outcome of this script is a bulk of output files. More specfically, a folder is created for each PV signal containing the outcome of all the test cases (e.g. the folder “X2012_m_donepezil.SSRI.SNRI.interaction.Serotoni.syndrome.2nd” contains the results of the specific signal). Moreover, a validation report file is produced for each PV signal in a folder named “shaclResults” (only if warnings/errors are detected)[1].

OpenPVSignal is versioned following the semantic versioning scheme, applying the guidelines provided here. The last part of the version refers to the date that this version has been created. For example, version 0.8.20171211 is the version that has been first commited on GitHub, on 2017, 11th of December. Typically, each new version should be worked on a branch, following the rationale presented here.

[1] Note that some of the constrained imposed by the SHACL tests still appear on the results as a violation. This can be attributed to the original data sources, i.e. for a patient appearing on an Individual Case Safety Report (ICSR) their gender and age should have been recorded but this is not the case for each and every ICSR. The SHACL tests that have been written, will mark that as a violation and it should be manually verified that the field is indeed missing on the original data.

For further information or guidance on how you can use OpenPVSignal or contribute in its further development, please contact Pantelis NATSIAVAS (M) (pnatsiavas@certh.gr) or Achilleas CHYTAS (M) (achytas@certh.gr).