Kernel methods for large-scale genomic data analysis.

Journal: Briefings in bioinformatics
Published Date:

Abstract

Machine learning, particularly kernel methods, has been demonstrated as a promising new tool to tackle the challenges imposed by today's explosive data growth in genomics. They provide a practical and principled approach to learning how a large number of genetic variants are associated with complex phenotypes, to help reveal the complexity in the relationship between the genetic markers and the outcome of interest. In this review, we highlight the potential key role it will have in modern genomic data processing, especially with regard to integration with classical methods for gene prioritizing, prediction and data fusion.

Authors

  • Xuefeng Wang
    Department of Advanced Manufacturing and Robotics, College of Engineering, Peking University, Beijing 100871, China.
  • Eric P Xing
    Department of Machine Learning, Carnegie-Mellon University, Pittsburgh, PA 15213.
  • Daniel J Schaid