Data Curation - AI Medical Compendium

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

AMIA ... Annual Symposium proceedings. AMIA Symposium Nov 5, 2015

Clinical Named Entity Recognition (NER) is a critical task for extracting important patient information from clinical text to support clinical and translational research. This study explored the neural word embeddings derived from a large unlabeled c...

Pattern Recognition, Automated Terminology as Topic Natural Language Processing Semantics Data Curation Humans Algorithms

View on PubMed

Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx.

AMIA ... Annual Symposium proceedings. AMIA Symposium Nov 5, 2015

Detection of early symptoms in cervical cancer is crucial for early treatment and survival. To find symptoms of cervical cancer in clinical text, Named Entity Recognition is needed. In this paper the Clinical Entity Finder, a machine-learning tool tr...

Machine Learning Electronic Health Records Humans Uterine Cervical Neoplasms Natural Language Processing Female Data Curation Sweden

View on PubMed

Scaling Out and Evaluation of OBSecAn, an Automated Section Annotator for Semi-Structured Clinical Documents, on a Large VA Clinical Corpus.

AMIA ... Annual Symposium proceedings. AMIA Symposium Nov 5, 2015

"Identifying and labeling" (annotating) sections improves the effectiveness of extracting information stored in the free text of clinical documents. OBSecAn, an automated ontology-based section annotator, was developed to identify and label sections ...

Data Curation Reference Books, Medical Natural Language Processing United States Department of Veterans Affairs Humans Algorithms United States

View on PubMed

SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data.

Database : the journal of biological databases and curation Sep 18, 2015

There is an urgent need to standardize the semantics of biomedical data values, such as phenotypes, to enable comparative and integrative analyses. However, it is unlikely that all studies will use the same data collection protocols. As a result, ret...

Knowledge Bases Databases, Factual Humans Data Curation Software Biological Ontologies Animals

View on PubMed DOI

Using distant supervised learning to identify protein subcellular localizations from full-text scientific articles.

Journal of biomedical informatics Jul 26, 2015

Databases of curated biomedical knowledge, such as the protein-locations reflected in the UniProtKB database, provide an accurate and useful resource to researchers and decision makers. Our goal is to augment the manual efforts currently used to cura...

Supervised Machine Learning Knowledge Bases Data Curation Proteins Databases, Protein

View on PubMed DOI

Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy.

Database : the journal of biological databases and curation May 13, 2015

The diverse phenotypes of living organisms have been described for centuries, and though they may be digitized, they are not readily available in a computable form. Using over 100 morphological studies, the Phenoscape project has demonstrated that by...

Natural Language Processing Data Mining Databases, Factual Humans Animals Biological Ontologies Anatomy, Comparative Data Curation

View on PubMed DOI

The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations.

Database : the journal of biological databases and curation May 9, 2015

Biocuration has become a cornerstone for analyses in biology, and to meet needs, the amount of annotations has considerably grown in recent years. However, the reliability of these annotations varies; it has thus become necessary to be able to assess...

Data Curation Congresses as Topic Biological Ontologies

View on PubMed DOI

Ontology application and use at the ENCODE DCC.

Database : the journal of biological databases and curation Mar 16, 2015

The Encyclopedia of DNA elements (ENCODE) project is an ongoing collaborative effort to create a catalog of genomic annotations. To date, the project has generated over 4000 experiments across more than 350 cell lines and tissues using a wide array o...

Gene Regulatory Networks Mice Databases, Genetic Molecular Sequence Annotation Humans Data Curation Animals Transcription, Genetic Gene Ontology

View on PubMed DOI

Shared resources, shared costs--leveraging biocuration resources.

Database : the journal of biological databases and curation Mar 16, 2015

The manual curation of the information in biomedical resources is an expensive task. This article argues the value of this approach in comparison with other apparently less costly options, such as automated annotation or text-mining, then discusses w...

Data Mining Databases, Genetic Gene Ontology Data Curation

View on PubMed DOI

mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support.

Database : the journal of biological databases and curation Mar 8, 2015

Enzymes active on components of lignocellulosic biomass are used for industrial applications ranging from food processing to biofuels production. These include a diverse array of glycoside hydrolases, carbohydrate esterases, polysaccharide lyases and...

Data Curation Enzymes Fungal Proteins Genes, Fungal Data Mining Lignin Databases, Genetic Natural Language Processing

View on PubMed DOI

AIMC Topic: Data Curation

A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.

Finding Cervical Cancer Symptoms in Swedish Clinical Text using a Machine Learning Approach and NegEx.

Scaling Out and Evaluation of OBSecAn, an Automated Section Annotator for Semi-Structured Clinical Documents, on a Large VA Clinical Corpus.

SORTA: a system for ontology-based re-coding and technical annotation of biomedical phenotype data.

Using distant supervised learning to identify protein subcellular localizations from full-text scientific articles.

Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy.

The Confidence Information Ontology: a step towards a standard for asserting confidence in annotations.

Ontology application and use at the ENCODE DCC.

Shared resources, shared costs--leveraging biocuration resources.

mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support.

Popular Topics

Recent Journals

AIMC Topic: Data Curation

Don't Miss the Future of Medicine

Popular Topics

Recent Journals