Traversing chemical space with active deep learning for low-data drug discovery.

Journal: Nature computational science

Published Date: Sep 27, 2024

Abstract

Deep learning is accelerating drug discovery. However, current approaches are often affected by limitations in the available data, in terms of either size or molecular diversity. Active deep learning has high potential for low-data drug discovery, as it allows iterative model improvement during the screening process. However, there are several 'known unknowns' that limit the wider adoption of active deep learning in drug discovery: (1) what the best computational strategies are for chemical space exploration, (2) how active learning holds up to traditional, non-iterative, approaches and (3) how it should be used in the low-data scenarios typical of drug discovery. To provide answers, this study simulates a low-data drug discovery scenario, and systematically analyzes six active learning strategies combined with two deep learning architectures, on three large-scale molecular libraries. We identify the most important determinants of success in low-data regimes and show that active learning can achieve up to a sixfold improvement in hit discovery when compared with traditional screening methods.

Authors

Derek van Tilborg

Institute for Complex Molecular Systems and Dept. Biomedical Engineering, Eindhoven University of Technology, 5612AZEindhoven, The Netherlands.
Francesca Grisoni

Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), Vladimir-Prelog-Weg 4, CH-, 8093, Zurich, Switzerland.

Keywords

Deep Learning Drug Discovery Humans Small Molecule Libraries

External Resources

View on PubMed Access via DOI PubMed (39333789)

Traversing chemical space with active deep learning for low-data drug discovery.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals