Using predicate and provenance information from a knowledge graph for drug efficacy screening.

Journal: Journal of biomedical semantics
Published Date:

Abstract

BACKGROUND: Biomedical knowledge graphs have become important tools to computationally analyse the comprehensive body of biomedical knowledge. They represent knowledge as subject-predicate-object triples, in which the predicate indicates the relationship between subject and object. A triple can also contain provenance information, which consists of references to the sources of the triple (e.g. scientific publications or database entries). Knowledge graphs have been used to classify drug-disease pairs for drug efficacy screening, but existing computational methods have often ignored predicate and provenance information. Using this information, we aimed to develop a supervised machine learning classifier and determine the added value of predicate and provenance information for drug efficacy screening. To ensure the biological plausibility of our method we performed our research on the protein level, where drugs are represented by their drug target proteins, and diseases by their disease proteins.

Authors

  • Wytze J Vlietstra
    Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands. w.vlietstra@erasmusmc.nl.
  • Rein Vos
    Department of Medical Informatics, Erasmus University Medical Centre, Rotterdam, 3015, GE, the Netherlands.
  • Anneke M Sijbers
    Centre for Molecular and Biomolecular Informatics, Radboudumc, Nijmegen, 6525, GA, the Netherlands.
  • Erik M van Mulligen
    Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands.
  • Jan A Kors
    Department of Medical Informatics, Erasmus University Medical Center, Rotterdam, The Netherlands j.kors@erasmusmc.nl.