Deep reinforcement learning enables better bias control in benchmark for virtual screening.

Journal: Computers in biology and medicine

Published Date: Feb 15, 2024

Abstract

Virtual screening (VS) has been incorporated into the paradigm of modern drug discovery. This field is now undergoing a new wave of revolution driven by artificial intelligence and more specifically, machine learning (ML). In terms of those out-of-the-box datasets for model training or benchmarking, their data volume and applicability domain are limited. They are suffering from the biases constantly reported in the ML application. To address these issues, we present a novel benchmark named MUBD. The utilization of synthetic decoys (i.e., presumed inactives) is the main feature of MUBD, where deep reinforcement learning was leveraged for bias control during decoy generation. Then, we carried out extensive validations on this new benchmark. First, we confirmed that MUBD was superior to the classical benchmarks in control of domain bias, artificial enrichment bias and analogue bias. Moreover, we found that the assessment of ML models based on MUBD was less biased as revealed by the analysis of asymmetric validation embedding bias. In addition, MUBD showed better setting of benchmarking challenge for deep learning models compared with NRLiSt-BDB. Overall, we have proven that MUBD is the close-to-ideal benchmark for VS. The computational tool is publicly available for the easy extension of MUBD.

Authors

Tao Shen

Shanghai Chenpon Pharmaceutical Co., Ltd., Shanghai, China.
Shan Li

College of Mathematics and Physics, Qingdao University of Science and Technology, Qingdao 266061, China; Artificial Intelligence and Biomedical Big Data Research Center, Qingdao University of Science and Technology, Qingdao 266061, China. Electronic address: lishan5600@163.com.
Xiang Simon Wang

Artificial Intelligence and Drug Discovery Core Laboratory for District of Columbia Center for AIDS Research (DC CFAR), Department of Pharmaceutical Sciences, College of Pharmacy, Howard University, U.S.A.
Dongmei Wang

Department of Gastrointestinal Surgery, Affiliated Changzhou No. 2 People's Hospital of Nanjing Medical University, The Third Affiliated Hospital of Nanjing Medical University, Changzhou Medical Center, Nanjing Medical University, No. 68 Gehu Road, Wujin District, Changzhou City, 213000, Jiangsu, China. dongmeiwang0526@163.com.
Song Wu

National Engineering Research Center for Big Data Technology and System, Services Computing Technology and System Lab, Cluster and Grid Computing Lab, School of Computer Science and Technology, Huazhong University of Science and Technology, Wuhan, China.
Jie Xia

Department of Respiratory and Critical Care Medicine, Tongji Hospital, Tongji Medical College, Huazhong University of Sciences & Technology, Wuhan, People's Republic of China.
Liangren Zhang

State Key Laboratory of Natural and Biomimetic Drugs, School of Pharmaceutical Sciences, Peking University, 100191 Beijing, P. R. China.

Keywords

Artificial Intelligence Bias Drug Discovery Machine Learning

External Resources

View on PubMed Access via DOI PubMed (38402838)

Deep reinforcement learning enables better bias control in benchmark for virtual screening.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals