Self-training in significance space of support vectors for imbalanced biomedical event data.

Journal: BMC bioinformatics

Published Date: Apr 23, 2015

Abstract

BACKGROUND: Pairwise relationships extracted from biomedical literature are insufficient in formulating biomolecular interactions. Extraction of complex relations (namely, biomedical events) has become the main focus of the text-mining community. However, there are two critical issues that are seldom dealt with by existing systems. First, an annotated corpus for training a prediction model is highly imbalanced. Second, supervised models trained on only a single annotated corpus can limit system performance. Fortunately, there is a large pool of unlabeled data containing much of the domain background that one can exploit.

Authors

Tsendsuren Munkhdalai
Oyun-Erdene Namsrai
Keun Ryu

Keywords

Algorithms Data Mining Databases, Bibliographic Humans Information Storage and Retrieval Models, Theoretical Natural Language Processing Periodicals as Topic Terminology as Topic

External Resources

View on PubMed Access via DOI PubMed (25952719)

Self-training in significance space of support vectors for imbalanced biomedical event data.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals