AIMC Topic: Data Mining

Clear Filters Showing 1201 to 1210 of 1626 articles

Constrained Tensor Factorization for Cancer Phenotyping and Mortality Prediction.

Studies in health technology and informatics
Electronic health records (EHR) enable machine learning methods like tensor factorization to extract computational phenotypes. Using Northwestern Medicine data (2000-2015), we analyzed breast, prostate, colorectal, and lung cancer cohorts to predict ...

Beyond GPT-NER: ChatGPT as Ensemble Arbitrator for Discontinuous Named Entity Recognition in Health Corpora.

Studies in health technology and informatics
In medicine and healthcare, NER (Named Entity Recognition) involves identifying clinically relevant entities such as medications, symptoms, and adverse drug events (ADEs). This task is particularly challenging due to discontinuous NER (DNER), fragmen...

The Best of All Worlds: A Hybrid Approach to Cohort Identification with Rules, Small and Large Language Models.

Studies in health technology and informatics
Balancing operational feasibility with the performance of natural language processing (NLP) systems is a significant challenge. This study presents a hybrid strategy to integrate manually curated rules, small language model (SLM), and large language ...

Optimizing Entity Recognition in Psychiatric Treatment Data with Large Language Models.

Studies in health technology and informatics
Extracting nuanced adverse drug reactions (ADRs) from patient self-reported messages using is pivotal but challenging, particularly given HIPAA constraints. We investigate locally deployable small LLMs-Mistral-7B, Llama-3-8B, and Gemma-7B-for ADR ext...

Development and Evaluation of Natural Language Processing Methods for Extracting Key Melanoma Pathology Concepts.

Studies in health technology and informatics
This study presents the development and evaluation of an annotation schema and rule-based natural language processing (NLP) system for extracting key melanoma pathology concepts from surgical pathology reports. Achieving high precision and recall, ou...

Monitoring Over-The-Counter Drug Misuse in Japanese User-Generated Data.

Studies in health technology and informatics
INTRODUCTION: The misuse of over-the-counter (OTC) drugs poses a significant global public health challenge. This study proposes a system for detecting and visualizing inappropriate OTC drug use in social media data.

EvidenceOutcomes: A Dataset of Clinical Trial Publications with Clinically Meaningful Outcomes.

Studies in health technology and informatics
The fundamental process of evidence extraction in evidence-based medicine relies on identifying PICO elements, with Outcomes being the most complex and often overlooked. To address this, we introduce EvidenceOutcomes, a large annotated corpus of clin...

PheCatcher: Leveraging LLM-Generated Synthetic Data for Automated Phenotype Definition Extraction from Biomedical Literature.

Studies in health technology and informatics
Phenotype definitions are crucial for the progression of precision and personalized medicine. Although phenotype knowledge bases such as PheKB and the OHDSI library are available, they rely heavily on manual input. This study introduces PheCatcher, a...

A Framework for Extracting, and Validating Named-Entities to Integrate Openehr Using the Example of Free Text Molecular Genetic Findings.

Studies in health technology and informatics
Processing and extracting information from unstructured texts written by physicians in Hospitals is still an open problem. There is no efficient solution that ensures the reliability of the extracted information without any human intervention. Many f...

Natural Language Processing-Based Approach to Detect Common Adverse Events of Anticancer Agents from Unstructured Clinical Notes: A Time-to-Event Analysis.

Studies in health technology and informatics
This study assessed the effectiveness of natural language processing (NLP) in detecting adverse events (AEs) from anticancer agents by analyzing data from over 39,000 cancer patients. A specialized machine learning model identified known AEs from ant...