Gene ontology concept recognition using named concept: understanding the various presentations of the gene functions in biomedical literature.

Journal: Database : the journal of biological databases and curation
Published Date:

Abstract

OBJECTIVE: A major challenge in precision medicine is the development of patient-specific genetic biomarkers or drug targets. The firsthand information of the genes associated with the pathologic pathways of interest is buried in the ocean of biomedical literature. Gene ontology concept recognition (GOCR) is a biomedical natural language processing task used to extract and normalize the mentions of gene ontology (GO), the controlled vocabulary for gene functions across many species, from biomedical text. The previous GOCR systems, using either rule-based or machine-learning methods, treated GO concepts as separate terms and did not have an efficient way of sharing the common synonyms among the concepts.

Authors

  • Chia-Jung Yang
    Department of Computer Science and Information Engineering, National Cheng Kung University, 1, University Road, Tainan City, Taiwan.
  • Jung-Hsien Chiang
    Department of Computer Science and Information Engineering, National Cheng Kung University, 1, University Road, Tainan City, Taiwan.