Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature.

Journal: Studies in health technology and informatics

Published Date: Jan 1, 2015

Abstract

Relation extraction typically involves the extraction of relations between two or more entities occurring within a single or multiple sentences. In this study, we investigated the significance of extracting information from multiple sentences specifically in the context of drug-disease relation discovery. We used multiple resources such as Semantic Medline, a literature based resource, and Medline search (for filtering spurious results) and inferred 8,772 potential drug-disease pairs. Our analysis revealed that 6,450 (73.5%) of the 8,772 potential drug-disease relations did not occur in a single sentence. Moreover, only 537 of the drug-disease pairs matched the curated gold standard in Comparative Toxicogenomics Database (CTD), a trusted resource for drug-disease relations. Among the 537, nearly 75% (407) of the drug-disease pairs occur in multiple sentences. Our analysis revealed that the drug-disease pairs inferred from Semantic Medline or retrieved from CTD could be extracted from multiple sentences in the literature. This highlights the significance of the need of discourse-level analysis in extracting the relations from biomedical literature.

Authors

Majid Rastegar-Mojarad

Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
Ravikumar Komandur Elayavilli

Mayo Clinic, Rochester, MN, USA.
Dingcheng Li

These authors contributed equally to this study and Dr. Li is now working at IBM; Department of Health Sciences Research, Mayo Clinic, Rochester, MN, USA.
Hongfang Liu

Department of Artificial Intelligence & Informatics, Mayo Clinic, Rochester, MN, United States.

Keywords

Data Mining Drug-Related Side Effects and Adverse Reactions Humans Machine Learning MEDLINE Natural Language Processing Needs Assessment Periodicals as Topic Science Semantics

External Resources

View on PubMed PubMed (26262109)

Assessing the Need of Discourse-Level Analysis in Identifying Evidence of Drug-Disease Relations in Scientific Literature.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals