MAGPIE: accurate pathogenic prediction for multiple variant types using machine learning approach.

Journal: Genome medicine
PMID:

Abstract

Identifying pathogenic variants from the vast majority of nucleotide variation remains a challenge. We present a method named Multimodal Annotation Generated Pathogenic Impact Evaluator (MAGPIE) that predicts the pathogenicity of multi-type variants. MAGPIE uses the ClinVar dataset for training and demonstrates superior performance in both the independent test set and multiple orthogonal validation datasets, accurately predicting variant pathogenicity. Notably, MAGPIE performs best in predicting the pathogenicity of rare variants and highly imbalanced datasets. Overall, results underline the robustness of MAGPIE as a valuable tool for predicting pathogenicity in various types of human genome variations. MAGPIE is available at https://github.com/shenlab-genomics/magpie .

Authors

  • Yicheng Liu
    Zhejiang Sci-Tech University, Hangzhou, Zhejiang, China.
  • Tianyun Zhang
  • Ningyuan You
    Department of Hepatobiliary and Pancreatic Surgery, First Affiliated > Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 311121, China.
  • Sai Wu
    Department of Hepatobiliary and Pancreatic Surgery, First Affiliated > Hospital & Liangzhu Laboratory, Zhejiang University School of Medicine, Hangzhou, 311121, China. wusai@zju.edu.cn.
  • Ning Shen
    State Key Laboratory on Integrated Optoelectronics, College of Electronic Science and Engineering, Jilin University, 130012 Changchun, China. Electronic address: shenning17@mails.jlu.edu.cn.