AIMC Topic: Data Curation

Clear Filters Showing 1 to 10 of 142 articles

Utilizing Large language models to select literature for meta-analysis shows workload reduction while maintaining a similar recall level as manual curation.

BMC medical research methodology
BACKGROUND: Large language models (LLMs) like ChatGPT showed great potential in aiding medical research. A heavy workload in filtering records is needed during the research process of evidence-based medicine, especially meta-analysis. However, few st...

Data stewardship and curation practices in AI-based genomics and automated microscopy image analysis for high-throughput screening studies: promoting robust and ethical AI applications.

Human genomics
BACKGROUND: Researchers have increasingly adopted AI and next-generation sequencing (NGS), revolutionizing genomics and high-throughput screening (HTS), and transforming our understanding of cellular processes and disease mechanisms. However, these a...

Machine learning tools match physician accuracy in multilingual text annotation.

Scientific reports
In the medical field, text annotation involves categorizing clinical and biomedical texts with specific medical categories, enhancing the organization and interpretation of large volumes of unstructured data. This process is crucial for developing to...

Partial Annotation Learning for Biomedical Entity Recognition.

IEEE journal of biomedical and health informatics
Named Entity Recognition (NER) is a key task to support biomedical research. In Biomedical Named Entity Recognition (BioNER), obtaining high-quality expert annotated data is laborious and expensive, leading to the development of automatic approaches ...

Capturing Requirements for a Data Annotation Tool for Intensive Care: Experimental User-Centered Design Study.

JMIR human factors
BACKGROUND: Increasing use of computational methods in health care provides opportunities to address previously unsolvable problems. Machine learning techniques applied to routinely collected data can enhance clinical tools and improve patient outcom...

Active learning for extracting rare adverse events from electronic health records: A study in pediatric cardiology.

International journal of medical informatics
OBJECTIVE: Automate the extraction of adverse events from the text of electronic medical records of patients hospitalized for cardiac catheterization.

Influence of Data Curation and Confidence Levels on Compound Predictions Using Machine Learning Models.

Journal of chemical information and modeling
While data curation principles and practices are a major topic in data science, they are often not explicitly considered in machine learning (ML) applications in chemistry. We have been interested in evaluating the potential effects of data curation ...

Annotation Practices in Computational Pathology: A European Society of Digital and Integrative Pathology (ESDIP) Survey Study.

Laboratory investigation; a journal of technical methods and pathology
Integrating digital pathology and artificial intelligence (AI) algorithms can potentially improve diagnostic practice and precision medicine. Developing reliable, generalizable, and comparable AI algorithms depends on access to meticulously annotated...

Annotation of epilepsy clinic letters for natural language processing.

Journal of biomedical semantics
BACKGROUND: Natural language processing (NLP) is increasingly being used to extract structured information from unstructured text to assist clinical decision-making and aid healthcare research. The availability of expert-annotated documents for the d...

SeqImprove: Machine-Learning-Assisted Curation of Genetic Circuit Sequence Information.

ACS synthetic biology
The progress and utility of synthetic biology is currently hindered by the lengthy process of studying literature and replicating poorly documented work. Reconstruction of crucial design information through post hoc curation is highly noisy and error...