miDruglikeness: Subdivisional Drug-Likeness Prediction Models Using Active Ensemble Learning Strategies.

Journal: Biomolecules
Published Date:

Abstract

The drug development pipeline involves several stages including in vitro assays, in vivo assays, and clinical trials. For candidate selection, it is important to consider that a compound will successfully pass through these stages. Using graph neural networks, we developed three subdivisional models to individually predict the capacity of a compound to enter in vivo testing, clinical trials, and market approval stages. Furthermore, we proposed a strategy combing both active learning and ensemble learning to improve the quality of the models. The models achieved satisfactory performance in the internal test datasets and four self-collected external test datasets. We also employed the models as a general index to make an evaluation on a widely known benchmark dataset DEKOIS 2.0, and surprisingly found a powerful ability on virtual screening tasks. Our model system (termed as miDruglikeness) provides a comprehensive drug-likeness prediction tool for drug discovery and development.

Authors

  • Chenjing Cai
    Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, 100871, PR China.
  • Haoyu Lin
    School of Mechatronics Engineering, Harbin Institute of Technology, Harbin 150001, China.
  • Hongyi Wang
    Tianjin Key Laboratory of Intelligent Control of Electrical Equipment, Tiangong University, Tianjin 300387, People's Republic of China.
  • Youjun Xu
    Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, ‡Beijing National Laboratory for Molecular Sciences, State Key Laboratory for Structural Chemistry of Unstable and Stable Species, College of Chemistry and Molecular Engineering, and ¶Peking-Tsinghua Center for Life Sciences, Peking University , Beijing 100871, China.
  • Qi Ouyang
    State Key Laboratory for Artificial Microstructure and Mesoscopic Physics, Institute of Condensed Matter Physics, School of Physics, Center for Quantitative Biology, Peking University, Beijing, China.
  • Luhua Lai
    Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China.
  • Jianfeng Pei
    Center for Quantitative Biology, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China.