AI driven cardiovascular risk prediction using NLP and Large Language Models for personalized medicine in athletes.
Journal:
SLAS technology
Published Date:
Apr 10, 2025
Abstract
The performance and long-term health of athletes are significantly influenced by their cardiovascular resilience and associated risk factors. This study explores the innovative applications of Natural Language Processing (NLP) and Large Language Models (LLMs) in biomedical diagnostics, particularly for AI-driven arrhythmia detection, hypertrophic cardiomyopathy (HCM) in athletes, and personalized medicine. The complexity of analysing diverse biomedical datasets, such as electrocardiograms (ECG), clinical records, genetic screening reports, and imaging results, poses challenges in obtaining precise early diagnoses. To address these issues, we introduce a hybrid machine learning (ML) framework that integrates the Wolf Pack Search Algorithm Dynamic Random Forest (WPSA-DRF) with a RoBERTa-based LLM to enhance the accuracy of cardiovascular disease predictions. Using advanced NLP techniques, including biomedical text mining, entity recognition, and feature extraction, the system processes structured and unstructured clinical data to detect abnormalities associated with sudden cardiac arrest (SCA), arrhythmias, and genetic cardiomyopathies. The proposed system achieves a diagnostic accuracy of 92.5 %, precision of 92.7 %, recall of 99.23 %, and F1-score of 95.6 %, outperforming traditional diagnostic methodologies. Furthermore, the research underscores the role of LLMs in personalized medicine, identifying patient-specific risk factors and optimizing treatment pathways for cardiac patients. This work highlights how NLP-driven AI solutions are transforming biomedical research, accelerating early disease detection, and improving clinical decision-making for both athletes and the general population.