DMGAT: predicting ncRNA-drug resistance associations based on diffusion map and heterogeneous graph attention network.

Journal: Briefings in bioinformatics
PMID:

Abstract

Non-coding RNAs (ncRNAs) play crucial roles in drug resistance and sensitivity, making them important biomarkers and therapeutic targets. However, predicting ncRNA-drug associations is challenging due to issues such as dataset imbalance and sparsity, limiting the identification of robust biomarkers. Existing models often fall short in capturing local and global sequence information, limiting the reliability of predictions. This study introduces DMGAT (diffusion map and heterogeneous graph attention network), a novel deep learning model designed to predict ncRNA-drug associations. DMGAT integrates diffusion maps for sequence embedding, graph convolutional networks for feature extraction, and GAT for heterogeneous information fusion. To address dataset imbalance, the model incorporates sensitivity associations and employs a random forest classifier to select reliable negative samples. DMGAT embeds ncRNA sequences and drug SMILES using the word2vec technique, capturing local and global sequence information. The model constructs a heterogeneous network by combining sequence similarity and Gaussian Interaction Profile kernel similarity, providing a comprehensive representation of ncRNA-drug interactions. Evaluated through five-fold cross-validation on a curated dataset from NoncoRNA and ncDR, DMGAT outperforms seven state-of-the-art methods, achieving the highest area under the receiver operating characteristic curve (0.8964), area under the precision-recall curve (0.8984), recall (0.9576), and F1-score (0.8285). The raw data are released to Zenodo with identifier 13929676. The source code of DMGAT is available at https://github.com/liutingyu0616/DMGAT/tree/main.

Authors

  • Tingyu Liu
    School of Mechanical Engineering, Southeast University, Nanjing, 211189, China.
  • Qiuhao Chen
    Zhengzhou Research Institute, Harbin Instituteof Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China.
  • Renjie Liu
    Zhengzhou Research Institute, Harbin Instituteof Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China.
  • Yuzhi Sun
    School of Computer Science and Technology, Harbin Institute of Technology, 150000, Nangang District, Xidazhi Street No. 90, Harbin, Heilongjiang, China.
  • Yadong Wang
    The Biofoundry, Department of Biomedical Engineering, Cornell University, Ithaca, NY, United States.
  • Yan Zhu
    Department of Chemistry, Xixi Campus, Zhejiang University, Hangzhou, 310028, China. Electronic address: zhuyan@zju.edu.cn.
  • Tianyi Zhao
    Department of Computer Science and Technology, Harbin Institute of Technology, Harbin, 150001, People's Republic of China.