Genotypic-phenotypic landscape computation based on first principle and deep learning.

Journal: Briefings in bioinformatics
Published Date:

Abstract

The relationship between genotype and fitness is fundamental to evolution, but quantitatively mapping genotypes to fitness has remained challenging. We propose the Phenotypic-Embedding theorem (P-E theorem) that bridges genotype-phenotype through an encoder-decoder deep learning framework. Inspired by this, we proposed a more general first principle for correlating genotype-phenotype, and the P-E theorem provides a computable basis for the application of first principle. As an application example of the P-E theorem, we developed the Co-attention based Transformer model to bridge Genotype and Fitness model, a Transformer-based pre-train foundation model with downstream supervised fine-tuning that can accurately simulate the neutral evolution of viruses and predict immune escape mutations. Accordingly, following the calculation path of the P-E theorem, we accurately obtained the basic reproduction number (${R}_0$) of SARS-CoV-2 from first principles, quantitatively linked immune escape to viral fitness and plotted the genotype-fitness landscape. The theoretical system we established provides a general and interpretable method to construct genotype-phenotype landscapes, providing a new paradigm for studying theoretical and computational biology.

Authors

  • Yuexing Liu
    Department of Endocrinology and Metabolism, Shanghai Jiao Tong University Affiliated Sixth People's Hospital, Shanghai Diabetes Institute, Shanghai Clinical Center for Diabetes, Shanghai, 200233, China.
  • Yao Luo
    Food Inspection & Quarantine Center, Shenzhen Customs, Shenzhen 518045, China.
  • Xin Lu
    CAS Key Laboratory of Separation Sciences for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian 116023, China.
  • Hao Gao
    Institute of Pharmaceutical Analysis , College of Pharmacy , Jinan University , Guangzhou , Guangdong 510632 , China . Email: haibo.zhou@jnu.edu.cn ; Email: jzjjackson@hotmail.com ; Email: tghao@jnu.edu.cn.
  • Ruikun He
    Shanghai Institute of Nutrition and Health, Chinese Academy of Sciences, Shanghai 200030, China.
  • Xin Zhang
    First Department of Infectious Diseases, The First Affiliated Hospital of China Medical University, Shenyang, China.
  • Xuguang Zhang
    Mengniu Institute of Nutrition Science, Shanghai 200126, China.
  • Yixue Li