AIMC Topic: Data Curation

Clear Filters Showing 101 to 110 of 142 articles

Annotating neurophysiologic data at scale with optimized human input.

Journal of neural engineering
Neuroscience experiments and devices are generating unprecedented volumes of data, but analyzing and validating them presents practical challenges, particularly in annotation. While expert annotation remains the gold standard, it is time consuming to...

The Evolution of Radiology Image Annotation in the Era of Large Language Models.

Radiology. Artificial intelligence
Although there are relatively few diverse, high-quality medical imaging datasets on which to train computer vision artificial intelligence models, even fewer datasets contain expertly classified observations that can be repurposed to train or test su...

Assessing the performance of generative artificial intelligence in retrieving information against manually curated genetic and genomic data.

Database : the journal of biological databases and curation
Curated resources at centralized repositories provide high-value service to users by enhancing data veracity. Curation, however, comes with a cost, as it requires dedicated time and effort from personnel with deep domain knowledge. In this paper, we ...

Automated Workflows for Data Curation and Machine Learning to Develop Quantitative Structure-Activity Relationships.

Methods in molecular biology (Clifton, N.J.)
The recent advancements in machine learning and the new availability of large chemical datasets made the development of tools and protocols for computational chemistry a topic of high interest. In this chapter a standard procedure to develop Quantita...

Automated annotation of scientific texts for ML-based keyphrase extraction and validation.

Database : the journal of biological databases and curation
Advanced omics technologies and facilities generate a wealth of valuable data daily; however, the data often lack the essential metadata required for researchers to find, curate, and search them effectively. The lack of metadata poses a significant c...

DUVEL: an active-learning annotated biomedical corpus for the recognition of oligogenic combinations.

Database : the journal of biological databases and curation
While biomedical relation extraction (bioRE) datasets have been instrumental in the development of methods to support biocuration of single variants from texts, no datasets are currently available for the extraction of digenic or even oligogenic vari...

Effective and efficient active learning for deep learning-based tissue image analysis.

Bioinformatics (Oxford, England)
MOTIVATION: Deep learning attained excellent results in digital pathology recently. A challenge with its use is that high quality, representative training datasets are required to build robust models. Data annotation in the domain is labor intensive ...

CARD 2023: expanded curation, support for machine learning, and resistome prediction at the Comprehensive Antibiotic Resistance Database.

Nucleic acids research
The Comprehensive Antibiotic Resistance Database (CARD; card.mcmaster.ca) combines the Antibiotic Resistance Ontology (ARO) with curated AMR gene (ARG) sequences and resistance-conferring mutations to provide an informatics framework for annotation a...

The reactome pathway knowledgebase 2022.

Nucleic acids research
The Reactome Knowledgebase (https://reactome.org), an Elixir core resource, provides manually curated molecular details across a broad range of physiological and pathological biological processes in humans, including both hereditary and acquired dise...