AIGen: an artificial intelligence software for complex genetic data analysis.

Journal: Briefings in bioinformatics
Published Date:

Abstract

The recent development of artificial intelligence (AI) technology, especially the advance of deep neural network (DNN) technology, has revolutionized many fields. While DNN plays a central role in modern AI technology, it has rarely been used in genetic data analysis due to analytical and computational challenges brought by high-dimensional genetic data and an increasing number of samples. To facilitate the use of AI in genetic data analysis, we developed a C++ package, AIGen, based on two newly developed neural networks (i.e. kernel neural networks and functional neural networks) that are capable of modeling complex genotype-phenotype relationships (e.g. interactions) while providing robust performance against high-dimensional genetic data. Moreover, computationally efficient algorithms (e.g. a minimum norm quadratic unbiased estimation approach and batch training) are implemented in the package to accelerate the computation, making them computationally efficient for analyzing large-scale datasets with thousands or even millions of samples. By applying AIGen to the UK Biobank dataset, we demonstrate that it can efficiently analyze large-scale genetic data, attain improved accuracy, and maintain robust performance. Availability: AIGen is developed in C++ and its source code, along with reference libraries, is publicly accessible on GitHub at https://github.com/TingtHou/AIGen.

Authors

  • Tingting Hou
    Department of Experimental Statistics, Louisiana State University, 45 Martin D. Woodin Hall, Baton Rouge, LA 70802, United States.
  • Xiaoxi Shen
    Texas State University, San Marcos, TX, USA.
  • Shan Zhang
  • Muxuan Liang
    Department of Statistics, University of Wisconsin-Madison, Madison, Wisconsin.
  • Li Chen
    Department of Endocrinology and Metabolism, Qilu Hospital, Shandong University, Jinan, China.
  • Qing Lu
    University of Florida, Gainesville, FL, USA.