BalancedDiff: Balanced Diffusion Network for High-Quality Molecule Generation.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Traditional drug discovery and development are time-consuming and expensive. Deep learning-based molecule generation techniques can reduce costs and improve efficiency, helping to generate high-quality molecules with desirable properties. However, existing deep learning-based methods focus on designing complex network structures to extract key features, which ignore the impact of sample bias and rarely take biochemical principles into account. To solve the above problems, a Balance Loss is proposed to balance sample bias. Second, we designed a KAN-based Balanced Feature Filtering (KBFF) module that balances molecular feature information with spatial location data, effectively filtering out unrelated groups. This approach ensures that the model considers both the chemical properties of functional groups and their spatial arrangements, minimizing noise while preserving critical biochemical relationships. By achieving this balance, the module improves the generated molecular quality. Besides, while diffusion models generate numerous molecules, their effectiveness and reliability remain uncertain, limiting their practical utility. To overcome this limitation, we introduce a QikProp module that predicts ADME properties, filtering out molecules with poor drug-like characteristics or potential safety risks, thereby enhancing the quality and applicability of generated molecules. Experiments on the CrossDocked2020 data set demonstrate the superiority of our method.

Authors

  • Yulong Wu
    Advanced Materials Division, Key Laboratory of Multifunctional Nanomaterials and Smart Systems, Suzhou Institute of Nano-Tech and Nano-Bionics, Chinese Academy of Sciences, Suzhou 215123, China.
  • Jin Xie
    School of Mathematics and Statistics, Xidian University, Xi'an 710071, PR China. Electronic address: xj6417@126.com.
  • Jing Nie
    National Clinical Research Center for Kidney Disease, State Key Laboratory for Organ Failure Research, Division of Nephrology, Nanfang Hospital, Southern Medical University, Guangzhou 510515, Guangdong Province, China.
  • Bonan Ding
    School of Big Data and Software Engineering, Chongqing University, Chongqing, 400044, China.
  • Yuansong Zeng
    School of Computer Science and Engineering, Sun Yat-sen University, Guangzhou 510000, China.
  • Jiale Cao
    School of Electrical and Information Engineering, Tianjin University, Tianjin, China.