AIMC Topic: Data Curation

Clear Filters Showing 81 to 90 of 142 articles

HPIDB 2.0: a curated database for host-pathogen interactions.

Database : the journal of biological databases and curation
Identification and analysis of host-pathogen interactions (HPI) is essential to study infectious diseases. However, HPI data are sparse in existing molecular interaction databases, especially for agricultural host-pathogen systems. Therefore, resourc...

Bi-convex Optimization to Learn Classifiers from Multiple Biomedical Annotations.

IEEE/ACM transactions on computational biology and bioinformatics
The problem of constructing classifiers from multiple annotators who provide inconsistent training labels is important and occurs in many application domains. Many existing methods focus on the understanding and learning of the crowd behaviors. Sever...

A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories.

Journal of biomedical informatics
This study examines the effectiveness of state-of-the-art supervised machine learning methods in conjunction with different feature types for the task of automatic annotation of fragments of clinical text based on codebooks with a large number of cat...

BELTracker: evidence sentence retrieval for BEL statements.

Database : the journal of biological databases and curation
Biological expression language (BEL) is one of the main formal representation models of biological networks. The primary source of information for curating biological networks in BEL representation has been literature. It remains a challenge to ident...

OntoStudyEdit: a new approach for ontology-based representation and management of metadata in clinical and epidemiological research.

Journal of biomedical semantics
BACKGROUND: The specification of metadata in clinical and epidemiological study projects absorbs significant expense. The validity and quality of the collected data depend heavily on the precise and semantical correct representation of their metadata...

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

AMIA ... Annual Symposium proceedings. AMIA Symposium
Clinical Named Entity Recognition (NER) is a critical task for extracting important patient information from clinical text to support clinical and translational research. This study explored the neural word embeddings derived from a large unlabeled c...

Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx.

AMIA ... Annual Symposium proceedings. AMIA Symposium
Detection of early symptoms in cervical cancer is crucial for early treatment and survival. To find symptoms of cervical cancer in clinical text, Named Entity Recognition is needed. In this paper the Clinical Entity Finder, a machine-learning tool tr...

Scaling Out and Evaluation of OBSecAn, an Automated Section Annotator for Semi-Structured Clinical Documents, on a Large VA Clinical Corpus.

AMIA ... Annual Symposium proceedings. AMIA Symposium
"Identifying and labeling" (annotating) sections improves the effectiveness of extracting information stored in the free text of clinical documents. OBSecAn, an automated ontology-based section annotator, was developed to identify and label sections ...

SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data.

Database : the journal of biological databases and curation
There is an urgent need to standardize the semantics of biomedical data values, such as phenotypes, to enable comparative and integrative analyses. However, it is unlikely that all studies will use the same data collection protocols. As a result, ret...

Using distant supervised learning to identify protein subcellular localizations from full-text scientific articles.

Journal of biomedical informatics
Databases of curated biomedical knowledge, such as the protein-locations reflected in the UniProtKB database, provide an accurate and useful resource to researchers and decision makers. Our goal is to augment the manual efforts currently used to cura...