Discovery of unconventional and nonintuitive self-assembling peptide materials using experiment-driven machine learning.
Journal:
Science advances
Published Date:
Jun 13, 2025
Abstract
Prediction of peptide secondary structure is challenging because of complex molecular interactions, sequence-specific behavior, and environmental factors. Traditional design strategies, based on hydrophobicity and structural propensity, can be biased and could indeed prevent discovery of interesting, diverse, and unconventional peptides with desired nanostructure assembly. Using β sheet formation in pentapeptides as a case study, we used an integrated high-throughput experimental workflow and an artificial intelligence-driven active learning framework to improve prediction accuracy of self-assembly. By focusing on sequences where machine learning (ML) predictions deviate from conventional design strategies, we synthesized and tested 268 pentapeptides, successfully finding 96 forming β sheet assemblies, including unconventional sequences (e.g., ILFSM, LMISI, MITIY, MISIW, and WKIYI) not predicted by traditional methods. Our ML models outperformed conventional β sheet propensity tables, revealing useful chemical design rules. A web interface is provided to facilitate community access to these models. This work highlights the value of ML-driven approaches in overcoming the limitations of current peptide design strategies.