Critical Assessment of RNA and DNA Structure Predictions via Artificial Intelligence: The Imitation Game.
Journal:
Journal of chemical information and modeling
PMID:
40159092
Abstract
Computational predictions of biomolecular structure via artificial intelligence (AI) based approaches, as exemplified by AlphaFold software, have the potential to model of all life's biomolecules. We performed oligonucleotide structure prediction and gauged the accuracy of the AI-generated models via their agreement with experimental solution-state observables. We find parts of these models in good agreement with experimental data, and others falling short of the ground truth. The latter include internal or capping loops, noncanonical base pairings, and regions involving conformational flexibility, all essential for RNA folding, interactions, and function. We estimate root-mean-square (r.m.s.) errors in predicted nucleotide bond vector orientations ranging between 7° and 30°, with higher accuracies for simpler architectures of individual canonically paired helical stems. These mixed results highlight the necessity of experimental validation of AI-based oligonucleotide model predictions and their current tendency to mimic the training data set rather than reproduce the underlying reality.