Machine Learning Methods for Classifying Multiple Sclerosis and Alzheimer's Disease Using Genomic Data.

Journal: International journal of molecular sciences
PMID:

Abstract

Complex diseases pose challenges in prediction due to their multifactorial and polygenic nature. This study employed machine learning (ML) to analyze genomic data from the UK Biobank, aiming to predict the genomic predisposition to complex diseases like multiple sclerosis (MS) and Alzheimer's disease (AD). We tested logistic regression (LR), ensemble tree methods, and deep learning models for this purpose. LR displayed remarkable stability across various subsets of data, outshining deep learning approaches, which showed greater variability in performance. Additionally, ML methods demonstrated an ability to maintain optimal performance despite correlated genomic features due to linkage disequilibrium. When comparing the performance of polygenic risk score (PRS) with ML methods, PRS consistently performed at an average level. By employing explainability tools in the ML models of MS, we found that the results confirmed the polygenicity of this disease. The highest-prioritized genomic variants in MS were identified as expression or splicing quantitative trait loci located in non-coding regions within or near genes associated with the immune response, with a prevalence of human leukocyte antigen (HLA) gene annotations. Our findings shed light on both the potential and the challenges of employing ML to capture complex genomic patterns, paving the way for improved predictive models.

Authors

  • Magdalena Arnal Segura
    Centre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova, Italy.
  • Giorgio Bini
    Centre for Human Technologies, Istituto Italiano di Tecnologia, Via Enrico Melen, 83, 16152 Genova, Italy.
  • Anastasia Krithara
    NCSR Demokritos, Ag. Paraskevi, Athens, 60228, Greece. akrithara@iit.demokritos.gr.
  • Georgios Paliouras
    Institute of Informatics and Telecommunications, NCSR Demokritos, Patr. Gregoriou E and 27 Neapoleos St, Athens, 15341, Greece.
  • Gian Gaetano Tartaglia
    Center for Human Technologies, Istituto Italiano di Tecnologia, Genoa, Italy. gian.tartaglia@iit.it.