AIMC Topic: Data Curation

Clear Filters Showing 81 to 90 of 147 articles

Structuring Legacy Pathology Reports by openEHR Archetypes to Enable Semantic Querying.

Methods of information in medicine
BACKGROUND: Clinical information is often stored as free text, e.g. in discharge summaries or pathology reports. These documents are semi-structured using section headers, numbered lists, items and classification strings. However, it is still challen...

Automatic query generation using word embeddings for retrieving passages describing experimental methods.

Database : the journal of biological databases and curation
Information regarding the physical interactions among proteins is crucial, since protein-protein interactions (PPIs) are central for many biological processes. The experimental techniques used to verify PPIs are vital for characterizing and assessing...

OntoBrowser: a collaborative tool for curation of ontologies by subject matter experts.

Bioinformatics (Oxford, England)
UNLABELLED: The lack of controlled terminology and ontology usage leads to incomplete search results and poor interoperability between databases. One of the major underlying challenges of data integration is curating data to adhere to controlled term...

Training and evaluation corpora for the extraction of causal relationships encoded in biological expression language (BEL).

Database : the journal of biological databases and curation
Success in extracting biological relationships is mainly dependent on the complexity of the task as well as the availability of high-quality training data. Here, we describe the new corpora in the systems biology modeling language BEL for training an...

Crowdsourcing and curation: perspectives from biology and natural language processing.

Database : the journal of biological databases and curation
Crowdsourcing is increasingly utilized for performing tasks in both natural language processing and biocuration. Although there have been many applications of crowdsourcing in these fields, there have been fewer high-level discussions of the methodol...

HPIDB 2.0: a curated database for host-pathogen interactions.

Database : the journal of biological databases and curation
Identification and analysis of host-pathogen interactions (HPI) is essential to study infectious diseases. However, HPI data are sparse in existing molecular interaction databases, especially for agricultural host-pathogen systems. Therefore, resourc...

Bi-convex Optimization to Learn Classifiers from Multiple Biomedical Annotations.

IEEE/ACM transactions on computational biology and bioinformatics
The problem of constructing classifiers from multiple annotators who provide inconsistent training labels is important and occurs in many application domains. Many existing methods focus on the understanding and learning of the crowd behaviors. Sever...

A study of the effectiveness of machine learning methods for classification of clinical interview fragments into a large number of categories.

Journal of biomedical informatics
This study examines the effectiveness of state-of-the-art supervised machine learning methods in conjunction with different feature types for the task of automatic annotation of fragments of clinical text based on codebooks with a large number of cat...

BELTracker: evidence sentence retrieval for BEL statements.

Database : the journal of biological databases and curation
Biological expression language (BEL) is one of the main formal representation models of biological networks. The primary source of information for curating biological networks in BEL representation has been literature. It remains a challenge to ident...

OntoStudyEdit: a new approach for ontology-based representation and management of metadata in clinical and epidemiological research.

Journal of biomedical semantics
BACKGROUND: The specification of metadata in clinical and epidemiological study projects absorbs significant expense. The validity and quality of the collected data depend heavily on the precise and semantical correct representation of their metadata...