Assessing and comparison of different machine learning methods in parent-offspring trios for genotype imputation.

Journal: Journal of theoretical biology
PMID:

Abstract

Genotype imputation is an important tool for prediction of unknown genotypes for both unrelated individuals and parent-offspring trios. Several imputation methods are available and can either employ universal machine learning methods, or deploy algorithms dedicated to infer missing genotypes. In this research the performance of eight machine learning methods: Support Vector Machine, K-Nearest Neighbors, Extreme Learning Machine, Radial Basis Function, Random Forest, AdaBoost, LogitBoost, and TotalBoost compared in terms of the imputation accuracy, computation time and the factors affecting imputation accuracy. The methods employed using real and simulated datasets to impute the un-typed SNPs in parent-offspring trios. The tested methods show that imputation of parent-offspring trios can be accurate. The Random Forest and Support Vector Machine were more accurate than the other machine learning methods. The TotalBoost performed slightly worse than the other methods.The running times were different between methods. The ELM was always most fast algorithm. In case of increasing the sample size, the RBF requires long imputation time.The tested methods in this research can be an alternative for imputation of un-typed SNPs in low missing rate of data. However, it is recommended that other machine learning methods to be used for imputation.

Authors

  • Abbas Mikhchi
    Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran. Electronic address: abbas_mikhchi@yahoo.com.
  • Mahmood Honarvar
    Department of Animal Science, Shahr-e-Qods Branch, Islamic Azad University, Tehran, Iran.
  • Nasser Emam Jomeh Kashan
    Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran.
  • Mehdi Aminafshar
    Department of Animal Science, Science and Research Branch, Islamic Azad University, Tehran, Iran.