MMF-MCP: A Deep Transfer Learning Model Based on Multimodal Information Fusion for Molecular Feature Extraction and Carcinogenicity Prediction.
Journal:
Journal of chemical information and modeling
Published Date:
Jul 25, 2025
Abstract
Molecular carcinogenicity is a crucial factor in the development of cancer, and accurate prediction of it is vital for cancer prevention, treatment, and drug development. In recent years, deep learning has been applied to predict molecular carcinogenicity, but due to limitations in data quality and feature richness, these methods still need improvement in terms of accuracy, robustness, and interpretability. In this article, we propose a deep transfer learning model based on multimodal information fusion, called MMF-MCP, for molecular feature extraction and carcinogenicity prediction. We extract molecular graph features and fingerprint features using graph attention networks and convolutional neural networks, respectively, and process molecular images through a deep residual network, SE-ResNet18, equipped with a squeeze-and-excitation module. To more effectively utilize limited carcinogenicity data and enhance the model's predictive performance and generalization ability, we further apply a transfer learning strategy by pretraining the model on a molecular mutagenicity data set and then fine-tuning it on the carcinogenicity data set, enabling knowledge transfer and significant improvement in model performance. MMF-MCP achieves average ACC, AUC, SE, and SP scores of 0.8452, 0.8513, 0.8571, and 0.8333 on benchmark data sets for molecular carcinogenicity, significantly outperforming state-of-the-art molecular carcinogenicity prediction methods. Additionally, the visualization results of MMF-MCP on molecular images demonstrate its strong interpretability, providing significant assistance in visually observing and understanding the critical structures and features of molecular carcinogenicity. The source code for MMF-MCP is available at https://github.com/liuliwei1980/MCP.