Automatic extraction of protein-protein interactions using grammatical relationship graph.

Journal: BMC medical informatics and decision making
Published Date:

Abstract

BACKGROUND: Relationships between bio-entities (genes, proteins, diseases, etc.) constitute a significant part of our knowledge. Most of this information is documented as unstructured text in different forms, such as books, articles and on-line pages. Automatic extraction of such information and storing it in structured form could help researchers more easily access such information and also make it possible to incorporate it in advanced integrative analysis. In this study, we developed a novel approach to extract bio-entity relationships information using Nature Language Processing (NLP) and a graph-theoretic algorithm.

Authors

  • Kaixian Yu
    Insilicom LLC, Tallahassee FL, USA.
  • Pei-Yau Lung
    Department of Statistics, Florida State University, Tallahassee, FL, 32306, USA.
  • Tingting Zhao
    School of Software Engineering, Beihang University, Beijing, China.
  • Peixiang Zhao
    Department of Computer Science, Florida State University, Tallahassee, FL, 32306, USA.
  • Yan-Yuan Tseng
    Center for Molecular Medicine and Genetics, School of Medicine, Wayne State University, Detroit, MI, 48201, USA.
  • Jinfeng Zhang
    Department of Statistics, Florida State University, Tallahassee, FL, 32306, USA. jinfeng@stat.fsu.edu.