AIMC Topic: Data Curation

Clear Filters Showing 131 to 140 of 143 articles

Probabilistic and machine learning-based retrieval approaches for biomedical dataset retrieval.

Database : the journal of biological databases and curation
The bioCADDIE dataset retrieval challenge brought together different approaches to retrieval of biomedical datasets relevant to a user’s query, expressed as a text description of a needed dataset. We describe experiments in applying a data-driven, ma...

YummyData: providing high-quality open life science data.

Database : the journal of biological databases and curation
Many life science datasets are now available via Linked Data technologies, meaning that they are represented in a common format (the Resource Description Framework), and are accessible via standard APIs (SPARQL endpoints). While this is an important ...

Identification of errors in the IEDB using ontologies.

Database : the journal of biological databases and curation
The Immune Epitope Database (IEDB) is a free online resource that has manually curated over 18 500 references from the scientific literature. Our database presents experimental data relating to the recognition of immune epitopes by the adaptive immun...

FAIR principles and the IEDB: short-term improvements and a long-term vision of OBO-foundry mediated machine-actionable interoperability.

Database : the journal of biological databases and curation
The Immune Epitope Database (IEDB), at www.iedb.org, has the mission to make published experimental data relating to the recognition of immune epitopes easily available to the scientific public. By presenting curated data in a searchable database, we...

Automatic Annotation of French Medical Narratives with SNOMED CT Concepts.

Studies in health technology and informatics
Medical data is multimodal. In particular, it is composed of both structured data and narrative data (free text). Narrative data is a type of unstructured data that, although containing valuable semantic and conceptual information, is rarely reused. ...

Inter-Annotator Agreement and the Upper Limit on Machine Performance: Evidence from Biomedical Natural Language Processing.

Studies in health technology and informatics
Human-annotated data is a fundamental part of natural language processing system development and evaluation. The quality of that data is typically assessed by calculating the agreement between the annotators. It is widely assumed that this agreement ...

Non-Visually Performing Analytical Tasks on Statistical Charts.

Studies in health technology and informatics
This article proposes a natural language-based approach to accessibility of charts. Formal underpinnings are used to semantically annotate the constituent elements of a vector graphic to support accessing and modifying the content by natural language...

The Evidence and Conclusion Ontology (ECO): Supporting GO Annotations.

Methods in molecular biology (Clifton, N.J.)
The Evidence and Conclusion Ontology (ECO) is a community resource for describing the various types of evidence that are generated during the course of a scientific study and which are typically used to support assertions made by researchers. ECO des...