Genotype sampling for deep-learning assisted experimental mapping of a combinatorially complete fitness landscape.

Journal: Bioinformatics (Oxford, England)
Published Date:

Abstract

MOTIVATION: Experimental characterization of fitness landscapes, which map genotypes onto fitness, is important for both evolutionary biology and protein engineering. It faces a fundamental obstacle in the astronomical number of genotypes whose fitness needs to be measured for any one protein. Deep learning may help to predict the fitness of many genotypes from a smaller neural network training sample of genotypes with experimentally measured fitness. Here I use a recently published experimentally mapped fitness landscape of more than 260 000 protein genotypes to ask how such sampling is best performed.

Authors

  • Andreas Wagner