AIMC Topic: Data Curation

Clear Filters Showing 21 to 30 of 142 articles

Semi-Automated Data Curation from Biomedical Literature.

AMIA ... Annual Symposium proceedings. AMIA Symposium
Data curation is a bottleneck for many informatics pipelines. A specific example of this is aggregating data from preclinical studies to identify novel genetic pathways for atherosclerosis in humans. This requires extracting data from published mouse...

Clinicians' perception of oral potentially malignant disorders: a pitfall for image annotation in supervised learning.

Oral surgery, oral medicine, oral pathology and oral radiology
OBJECTIVE: The present study aims to quantify clinicians' perceptions of oral potentially malignant disorders (OPMDs) when evaluating, classifying, and manually annotating clinical images, as well as to understand the source of inter-observer variabi...

KaIDA: a modular tool for assisting image annotation in deep learning.

Journal of integrative bioinformatics
Deep learning models achieve high-quality results in image processing. However, to robustly optimize parameters of deep neural networks, large annotated datasets are needed. Image annotation is often performed manually by experts without a comprehens...

An Efficient Semi-Supervised Framework with Multi-Task and Curriculum Learning for Medical Image Segmentation.

International journal of neural systems
A practical problem in supervised deep learning for medical image segmentation is the lack of labeled data which is expensive and time-consuming to acquire. In contrast, there is a considerable amount of unlabeled data available in the clinic. To mak...

Comparison of radiologist versus natural language processing-based image annotations for deep learning system for tuberculosis screening on chest radiographs.

Clinical imaging
Although natural language processing (NLP) can rapidly extract disease labels from radiology reports to create datasets for deep learning models, this may be less accurate than having radiologists manually review the images. In this study, we compare...

Self-Supervised Robust Feature Matching Pipeline for Teach and Repeat Navigation.

Sensors (Basel, Switzerland)
The performance of deep neural networks and the low costs of computational hardware has made computer vision a popular choice in many robotic systems. An attractive feature of deep-learned methods is their ability to cope with appearance changes caus...

Active label cleaning for improved dataset quality under resource constraints.

Nature communications
Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have a confounding effect on the assessment of model performance. Nevertheless, employing experts to remove label noise by fully re...

The Challenge of Data Annotation in Deep Learning-A Case Study on Whole Plant Corn Silage.

Sensors (Basel, Switzerland)
Recent advances in computer vision are primarily driven by the usage of deep learning, which is known to require large amounts of data, and creating datasets for this purpose is not a trivial task. Larger benchmark datasets often have detailed proces...

The BMS-LM ontology for biomedical data reporting throughout the lifecycle of a research study: From data model to ontology.

Journal of biomedical informatics
Biomedical research data reuse and sharing is essential for fostering research progress. To this aim, data producers need to master data management and reporting through standard and rich metadata, as encouraged by open data initiatives such as the F...

Tracking cell lineages in 3D by incremental deep learning.

eLife
Deep learning is emerging as a powerful approach for bioimage analysis. Its use in cell tracking is limited by the scarcity of annotated data for the training of deep-learning models. Moreover, annotation, training, prediction, and proofreading curre...