Bayesian optimization and machine learning for vaccine formulation development.

Journal: PloS one
Published Date:

Abstract

Developing vaccines with a better stability is an area of improvement to meet the global health needs of preventing infectious diseases. With the advancement of data science and artificial intelligence, innovative approaches have emerged. This manuscript highlights the applications of machine learning through two cases in which Bayesian optimization was used to develop viral vaccine formulations. The two case studies monitored the critical quality attributes of virus A in liquid form by infectious titer loss and virus B in freeze-dried form by glass transition temperature. Stepwise analysis and model optimization demonstrated progressive improvements of model quality and prediction accuracy. The cross-validation matrices of the models' predictions showed high R² and low root mean square errors, indicating their reliability. The prediction accuracy of models was further validated by using test datasets. Model analysis using prediction error plot, Shapeley Additive exPlanations, permutation importance, etc. can provide additional insights into relations between model and experimental design, the influence of features of interest, and non-linear responses. Overall, Bayesian optimization is a useful complementary tool in formulation development that can help scientists make effective data-driven decisions.

Authors

  • Lillian Li
    Vaccine CMC Development & Supply, Sanofi, Toronto, Ontario, Canada.
  • Sung-In Back
    Vaccine CMC Development & Supply, Sanofi, Toronto, Ontario, Canada.
  • Jian Ma
  • Yawen Guo
    Vaccine CMC Development & Supply, Sanofi, Marcy-L' Etoile, France.
  • Thomas Galeandro-Diamant
    ChemAI Ltd. West Hill House, Allerton Hill, Chapel Allerton, Leeds, United Kingdom.
  • Didier Clénet
    Vaccine CMC Development & Supply, Sanofi, Marcy-L' Etoile, France.