Explainable machine learning for neoplasms diagnosis via electrocardiograms: an externally validated study
Journal:
arXiv
Published Date:
Dec 10, 2024
Abstract
Background: Neoplasms remains a leading cause of mortality worldwide, with
timely diagnosis being crucial for improving patient outcomes. Current
diagnostic methods are often invasive, costly, and inaccessible to many
populations. Electrocardiogram (ECG) data, widely available and non-invasive,
has the potential to serve as a tool for neoplasms diagnosis by using
physiological changes in cardiovascular function associated with neoplastic
prescences.
Methods: This study explores the application of machine learning models to
analyze ECG features for the diagnosis of neoplasms. We developed a pipeline
integrating tree-based models with Shapley values for explainability. The model
was trained and internally validated and externally validated on a second
large-scale independent external cohort to ensure robustness and
generalizability.
Findings: The results demonstrate that ECG data can effectively capture
neoplasms-associated cardiovascular changes, achieving high performance in both
internal testing and external validation cohorts. Shapley values identified key
ECG features influencing model predictions, revealing established and novel
cardiovascular markers linked to neoplastic conditions. This non-invasive
approach provides a cost-effective and scalable alternative for the diagnosis
of neoplasms, particularly in resource-limited settings. Similarly, useful for
the management of secondary cardiovascular effects given neoplasms therapies.
Interpretation: This study highlights the feasibility of leveraging ECG
signals and machine learning to enhance neoplasms diagnostics. By offering
interpretable insights into cardio-neoplasms interactions, this approach
bridges existing gaps in non-invasive diagnostics and has implications for
integrating ECG-based tools into broader neoplasms diagnostic frameworks, as
well as neoplasms therapy management.