Accelerating the optimization of enzyme-catalyzed synthesis conditions machine learning and reactivity descriptors.

Journal: Organic & biomolecular chemistry
Published Date:

Abstract

Enzyme-catalyzed synthesis reactions are of crucial importance for a wide range of applications. An accurate and rapid selection of optimal synthesis conditions is crucial and challenging for both human knowledge and computer predictions. In this work, a new scenario, which combines a data-driven machine learning (ML) model with reactivity descriptors, is developed to predict the optimal enzyme-catalyzed synthesis conditions and the reaction yield. Fourteen reactivity descriptors in total are constructed to describe 125 reactions (classified into five categories) included in different reaction mechanisms. Nineteen ML models are developed to train the dataset and the Quadratic support vector machine (SVM) model is found to exhibit the best performance. The Quadratic SVM model is then used to predict the optimal reaction conditions, which are subsequently used to obtain the highest yield among 109 200 reaction conditions with different molar ratios of substrates, solvents, water contents, enzyme concentrations and temperatures for each reaction. The proposed protocol should be generally applicable to a diverse range of chemical reactions and provides a black-box evaluation for optimizing the reaction conditions of organic synthesis reactions.

Authors

  • Zhongyu Wan
    Jiangsu Key Laboratory of Coal-based Greenhouse Gas Control and Utilization, Low Carbon Energy Institute and School of Chemical Engineering, China University of Mining and Technology, Xuzhou, 221008, People's Republic of China. quandewang@cumt.edu.cn zhongyuwanxzit@163.com and School of Science, City University of Hong Kong, Hong Kong SAR 999077, People's Republic of China.
  • Quan-De Wang
    Jiangsu Key Laboratory of Coal-based Greenhouse Gas Control and Utilization, Low Carbon Energy Institute and School of Chemical Engineering, China University of Mining and Technology, Xuzhou, 221008, People's Republic of China. quandewang@cumt.edu.cn zhongyuwanxzit@163.com.
  • Dongchang Liu
    School of Science, Xi'an Polytechnic University, Xi'an 710048, People's Republic of China and Department of Physics, Sungkyunkwan University, Suwon 16419, Korea.
  • Jinhu Liang
    School of Environment and Safety Engineering, North University of China, Taiyuan 030051, People's Republic of China.