Towards holistic phenotype prediction beyond genotypic data.

Journal: Journal of experimental botany
Published Date:

Abstract

Genomic Selection (GS) has revolutionised breeding programs by enabling the prediction of phenotypes based on genetic data. However, GS often only explains a portion of the phenotypic variation. This review paper explores the potential of integrating various data types beyond genomics to enhance the prediction ability of phenotypes. The paper categorises data integration strategies into five categories: eliminate, facilitate, aggregate, incorporate, and modulate. Eliminating refers to removing the effect of non-genomic data on the phenotype, such as environmental data. Facilitating methods leverage non-genomic data to improve the accuracy of GS models. Aggregating approaches combine different data types for analysis, potentially revealing variation components not captured by individual data sources. Incorporation focuses on explicitly modelling interactions between data types. Modulating methods transform data into formats suitable for advanced models like deep learning convolutional neural networks (CNNs). The review discusses the advantages and limitations of each strategy, providing a comprehensive overview of the current state of the field. The paper concludes by emphasising the prospects of multi-data phenotypic prediction toward the development of a holistic prediction approach that facilitates a more comprehensive understanding of complex biological systems and significantly enhances prediction accuracy.

Authors

Keywords

No keywords available for this article.