Toward a Coronavirus Knowledge Graph.

Journal: Genes
PMID:

Abstract

This study builds a coronavirus knowledge graph (KG) by merging two information sources. The first source is Analytical Graph (AG), which integrates more than 20 different public datasets related to drug discovery. The second source is CORD-19, a collection of published scientific articles related to COVID-19. We combined both chemo genomic entities in AG with entities extracted from CORD-19 to expand knowledge in the COVID-19 domain. Before populating KG with those entities, we perform entity disambiguation on CORD-19 collections using Wikidata. Our newly built KG contains at least 21,700 genes, 2500 diseases, 94,000 phenotypes, and other biological entities (e.g., compound, species, and cell lines). We define 27 relationship types and use them to label each edge in our KG. This research presents two cases to evaluate the KG's usability: analyzing a subgraph (ego-centered network) from the angiotensin-converting enzyme (ACE) and revealing paths between biological entities (hydroxychloroquine and IL-6 receptor; chloroquine and STAT1). The ego-centered network captured information related to COVID-19. We also found significant COVID-19-related information in top-ranked paths with a depth of three based on our path evaluation.

Authors

  • Peng Zhang
    Key Laboratory of Macromolecular Science of Shaanxi Province, School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xi'an, Shaanxi 710062, China.
  • Yi Bu
    Department of Information Management, Peking University, Beijing, China.
  • Peng Jiang
    Department of Joint Surgery, Shandong Provincial Hospital Affiliated to Shandong University, Jinan, Shandong 250021, China.
  • Xiaowen Shi
    Division of Biological Sciences, University of Missouri, Columbia, MO, 65211, USA.
  • Bing Lun
    Beijing Knowledge Atlas Technology Co., Ltd., Beijing 100043, China.
  • Chongyan Chen
    School of Information, University of Texas at Austin, Austin, TX, USA.
  • Arida Ferti Syafiandini
    Department of Library and Information Science, Yonsei University, Seoul 03722, Korea.
  • Ying Ding
    Cockrell School of Engineering, The University of Texas at Austin, Austin, USA.
  • Min Song
    Library and Information Science, Yonsei University, Seoul, South Korea.