A machine learning approach based on ACMG/AMP guidelines for genomic variant classification and prioritization.

Journal: Scientific reports
Published Date:

Abstract

Genomic variant interpretation is a critical step of the diagnostic procedure, often supported by the application of tools that may predict the damaging impact of each variant or provide a guidelines-based classification. We propose the application of Machine Learning methodologies, in particular Penalized Logistic Regression, to support variant classification and prioritization. Our approach combines ACMG/AMP guidelines for germline variant interpretation as well as variant annotation features and provides a probabilistic score of pathogenicity, thus supporting the prioritization and classification of variants that would be interpreted as uncertain by the ACMG/AMP guidelines. We compared different approaches in terms of variant prioritization and classification on different datasets, showing that our data-driven approach is able to solve more variant of uncertain significance (VUS) cases in comparison with guidelines-based approaches and in silico prediction tools.

Authors

  • Giovanna Nicora
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Susanna Zucca
    enGenome S.R.L., Pavia, Italy.
  • Ivan Limongelli
    IRCCS Policlinico S. Matteo, Pzz.le Volontari del Sangue 2, 27100, Pavia, Italy. ivan.limongelli@unipv.it.
  • Riccardo Bellazzi
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Paolo Magni
    Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy. paolo.magni@unipv.it.